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FUNGAL GENES REQUIRED FOR NORMAL GROWTH AND DEVELOPMENT 

The invention relates to nucleic acid sequences isolated from Ashbya gossypii that 
encode proteins essential for fungal growth. The invention also includes the methods of 
using these proteins pesticide targets, particularly fungicide targets, based on the 
essentiality of the gene for nomial growth and development. The invention is also useful as 
a screening assay to identify inhibitors that are potential pesticides, particularly fungicides. 

The phytopathogenic fungus Ashbya gossypii is a filamentously growing 
ascomycete that was first isolated as a plant pathogen in tropical and sub-tropical regions. 
It infects the seed capsule of cotton plants and has also been isolated from tomatoes and 
citrus fruits. The infection of the seed capsule is caused by transmission of A. gossypii 
mycelium pieces or spores by stinging-sucking insects and causes a disease called 
stigmatomycosis. Presently, A. gossypii represents the most compact eukaryotic genome, 
compared to genome sizes of 12.5 Mb for S. cerevisiae (Chu et al.. 1986), 31 .0 Mb for 
Aspergillus nidulans (Brody and Carbon, 1989) and 47.0 Mb for Neurospora crassa (Orbach 
etal., 1988). 

A. gossypii is systematically grouped to the endomycetales belonging to the family 
of spermophthoraceae. This classification is based on the observation that the spores that 
develop in hyphal compartments called sporangia look like ascospores, which are defined 
as endproducts of meiosis. 

Since Ashbya gossypii is a filamentous ascomycete, and is capable of growing only 
by filamentous (hypha!) growth, fungal targets found in this model organism are predictive 
of targets which will be found in other pathogens, the vast majority of which grow in a 
filamentous fashion. . 

DEFINITIONS 

For clarity, certain terms used in the specification are defined and presented as follows: 
Chimeric : is used to indicate that a DNA sequence, such as a vector or a gene, is 
comprised of more than one DNA sequences of distinct origin which are fused together by 
recombinant DNA techniques resulting In a DNA sequence, which does not occur naturally, 
and which particulariy does not occur in the plant to be transformed. 
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Co-facton natural reactant, such as an organic molecule or a metal ion, required in an 
enzyme-catalyzed reaction. A co-factor is e.g. NAD(P), riboflavin (including FAD and FMN), 
folate, molybdopterin, thiamin, biotin. ilpoic add, pantothenic acid and coenzyme A, S- 
adenosylmethionine, pyridoxal phosphate, ubiquinone, menaquinone. Optionally, a co- 
factor can be regenerated and reused. 

Enzyme activity: means herein the ability of an enzyme to catalyze the conversion of a 
substrate into a product. A substrate for the enzyme comprises the natural substrate of the 
enzyme but also comprises analogues of the natural substrate which can also be converted 
by the enzyme into a product or into an analogue of a product. The activity of the enzyme is 
measured for example by detennining the amount of product in the reaction after a certain 
period of time, or by determining the amount of substrate remaining in the reaction mixture 
after a certain period of time. The activity of the enzyme is also measured by detemiining 
the amount of an unused co-factor of the reaction remaining in the reaction mixture after a 
certain period of time or by detemiining the amount of used co-factor in the reaction mixture 
after a certain period of time. The activity of the enzyme is also measured by detennining 
the amount of a donor of free energy or energy-rich molecule (e.g. ATP, 
phosphoenolpymvate, acetyl phosphate or phosphocreatine) remaining In the reaction 
mixture after a certain period of time or by detemiining the amount of a used donor of free 
energy or energy-rich molecule (e.g. ADP, pyruvate, acetate or creatine) in the reaction 
mixture after a certain period of time. 

Expression: refers to the transcription and/or translation of an endogenous gene or a 
transgene In plants. In the case of antisense constnjcts, for example, expression may refer 
to the transcription of the antisense DNA only. 

Gene: refers to a coding sequence and associated regulatory sequences wherein the 
coding sequence is transcribed into RNA such as mRNA. rRNA, tRNA, snRNA, sense RNA 
or antisense RNA. Examples of regulatory sequences are promoter sequences, 5' and 3' 
untranslated sequences and termination sequences. Further elements that may be present 
are, for example, introns. 

Heterolooous DNA Sequence: a DNA sequence not naturally associated with a host 
cell into which it is introduced, including non-naturally occurring multiple copies of a 
naturally occurring DNA sequence. 

Homoloqous DNA Sequence: a DNA sequence naturally associated with a host cell 
into which it is introduced. 
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Isogenic : plants which are genetically identical, except that they may differ by the 
presence or absence of a transgene. 

Isolated: in the context of the present invention, an isolated DMA molecule or an 
Isolated enzyme is a DNA molecule or enzyme that, by the hand of man, exists apart from 
Its native environment and is therefore not a product of nature. An isolated DNA molecule 
or enzyme may exist in a purified form or may exist in a non-native environment such as, for 
example, a transgenic host cell. 

Mature protein : protein which is nomially targeted to a cellular organelle, such as a 
chloroplast, and from which the transit peptide has been removed. 

Minimal Promoter , promoter elements, particularly a TATA element, that are Inactive 
or that have greatly reduced promoter activity in the absence of upstream activation. In the 
presence of a suitable transcription factor, the minimal promoter functions to permit 
transcription. 

Modified Enzvme Activitv : enzyme activity different from that which naturally occurs in 
a plant (i.e. enzyme activity that occurs naturally in the absence of direct or indirect 
manipulation of such activity by man), which is tolerant to inhibitors that inhibit the naturally 
occurring enzyme activity. 

Recombinant DNA molecule : a combination of DNA sequences that are joined 

together using recombinant DNA technology 

Recombinant DNA technology: procedures used to join together DNA sequences as 
described, for example, in Sambrook et aL,1989. Cold Spring Hart^or, NY: Cold Spring 
Hari3or Laboratory Press 

Sionrficant Increase : an increase in enzymatic activity that is larger than the margin of 
en-or inherent in the measurement technique, preferably an increase by about 2-fold or 
greater of the activity of the wild-type enzyme in the presence of the inhibitor, more 
preferably an increase by about 5-fold or greater, and most preferably an increase by about 
10-fold or greater. 

Siantficantlv less : means that the amount of a product of an enzymatic reaction is 
larger than the margin of error inherent in the measurement technique, preferably a 
decrease by about 2-fold or greater of the activity of the wild-type enzyme in the absence of 
the inhibitor, more preferably an decrease by about 5-fold or greater, and most preferably 
an decrease by about 1 0-fold or greater. 

In its broadest sense, the term " substantiallv similar ", when used herein with respect 
to a nucleotide sequence, means a nucleotide sequence corresponding to a reference 
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nucleotide sequence, wherein the conesponding sequence encodes a polypeptide having 
substantially the same structure and function as the polypeptide encoded by the reference 
nucleotide sequence, e.g. where only changes In amino adds not affecting the polypeptide 
function occur. Desirably the substantially similar nucleotide sequence encodes the 
polypeptide encoded by the reference nucleotide sequence. The term "substantially similar 
is specifically intended to Include nucleotide sequences wherein the sequence has been 
modified to optimize expression in particular cells. The percentage of identity between the 
substantially similar nucleotide sequence and the reference nucleotide sequence desirably 
is at least 65%. more desirably at least 75%, preferably at least 85%. more preferably at 
least 90%. still more preferably at least 95%, yet still more preferably at least 99%. 
Sequence comparisons are canied out using a Smith-Waterman sequence alignment 
algorithm (see e.g. Watemian, M.S. Introduction to Computational Biology: Maps, 
sequences and genomes. Chapman & Hall. London: 1995. ISBN 0-412-99391-0. orat_ 

HYPERLINK "http-y/www-hto.usc.edu/software/seqaln/index.htmr http-y/www- 

hto.usc.edu/software/seqaln/index.htmL). The locals program, version 1.16. is used with 
following parameters: match: 1, mismatch penalty: 0.33, open-gap penalty: 2. extended-gap 
penalty: 2. A nucleotide sequence "substantially similar- to reference nucleotide sequence 
hybridizes to the reference nucleotide sequence in 7% sodium dodecyl sulfate (SDS), 0.5 M 
NaP04, 1 mM EDTA at 50»C with washing In 2X SSC, 0.1% SDS at SO^C. more desirably In 
7% sodium dodecyl sulfate (SDS). 0.5 M NaP04. 1 mM EDTA at 50»C with washing In 1X 
SSC, 0.1% SDS at 50»C, more desirably still in 7% sodium dodecyl sulfate (SDS). 0.5 M 
NaP04, 1 mM EDTA at 50»C with washing in 0.5X SSC, 0.1% SDS at 50»C, preferably in 
7% sodium dodecyl sulfate (SDS), 0.5 M NaP04, 1 mM EDTA at 50"C with washing in 0.1X 
SSC, 0.1% SDS at 50°C, more preferably in 7% sodium dodecyl sulfate (SDS), 0.5 M 
NaP04. 1 mM EDTA at 50»C with washing in 0.1 X SSC, 0.1% SDS at 65°C. 

"•^^ "substantially similar*, when used herein with respect to a protein, means a 
protein con-esponding to a reference protein, wherein the protein has substantially the same 
stmcture and function as the reference protein, e.g. where only changes in amino acids 
sequence not affecting the polypeptide function occur. When used for a protein or an amino 
acid sequence the percentage of identity between the substantially similar and the 
reference protein or amino acid sequence desirably is at least 52%. more desirably 65%, 
more desirably at least 75%, preferably at least 85%, more preferably at least 90%, still ' 
more preferably at least 95%, yet still more preferably at least 99%, 
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Substrate : a substrate is the molecule that the enzyme naturally recognizes and 
converts to a product in the biochemical pathway In which the enzyme naturally canies out 
Its function, or is a modified version of the molecule, which is also recognized by the 
enzyme and is converted by the enzyme to a product In an enzymatic reaction similar to the 
naturally-occurring reaction. 

Tolerance : the ability to continue normal growth or function when exposed to an 
inhibitor or herbicide in an amount sufficient to suppress the normal growth or function of 
native, unmodified plants. 

Transfomiatio n: a process for introducing heterologous DNA into a cell, tissue, or 
plant. Transfomied cells, tissues, or plants are understood to encompass not only the end 
product of a transformation process, but also transgenic progeny thereof. 

Transoenic : stably transformed with a recombinant DNA molecule that preferably 
comprises a suitable promoter operatively linked to a DNA sequence of interest. 

BRIEF DESCRIPTION OF THE SEQUENCES IN THE SEQUENCE LISTING 

SEQ ID NO:1 comprises a AG001 coding region 

SEQ ID NO:2comprises an amino acid sequence encoded by the coding region of SEQ ID 
NO:1 

SEQ ID NO:3 comprises a AG002 coding region. 

SEQ ID NO:4 comprises an amino acid sequence encoded by the coding region of SEQ ID 
NO:3. 

SEQ ID NO:5 comprises a AG003 coding region. 

SEQ ID NO:6 comprises an amino acid sequence encoded by the coding region of SEQ ID 
NO:5. 

SEQ ID NO:7 comprises a AG004 coding region. 

SEQ ID NO:8 comprises an amino acid sequence encoded by the coding region of SEQ ID 
NO:7. 

SEQ ID NO:9 comprises a AG005 coding region. 

SEQ ID NO:10 comprises an amino add sequence encoded by coding region of SEQ ID 
NO:9. 

SEQ ID NO:1 1 comprises a AG006 coding region. 

SEQ ID NO:12 comprises an amino acid sequence encoded by coding region of SEQ ID 
N0:11- 
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It is an object of the invention to provide an effective and beneficial method to 
identify novel pesticides, particularly fungicides. A feature of the invention is the 
identification of genes having a putative activity based on their homology to yeast genes. 
Genes of the invention comprise a putative GTP binding protein genes (herein referred to 
as AG001 and AG002 genes), putative GTPase activating protein genes(AG003 and 
AG004), putative phosphatidyIinositol-4 kinase protein gene (AG005) and putative 
cytol<inesis gene (AG006). Another feature of the invention is the discovery that the genes 
of the invention. AG001 (SEQ ID. NO: 1), AG002 (SEQ Id. NO 3):, AG003 (SEQ ID. NO: 5). 
AG004 (SEQ ID. NO: 7), AG005 (SEQ Id. NO: 9) and AG006(SEQ ID. NO: 1 1) are 
essential for fungal growth and development. An advantage of the present invention is that 
the newly discovered essential genes containing a novel fungicidal mode of action enables 
one skilled in the art to easily and rapidly identify novel fungicides. 

One object of the present invention is to provide essential genes in fungi for assay 
development to detect inhibitory compounds with pesticidal, particularly fungicidal activity. 
Genetic results show that when AG001 , AG002, AG003, AG004, AG005 and AG006 are 
mutated in Ashbya gossypii, the resulting phenotype is at best suppressed growth and at 
worst lethal. Suppressed growth as used herein results in a growth rate of half the growth 
rate observed in wild type or lower where 10% that of the wild-type growth rate was 
observed or no growth was macroscopically detected at all . Applicants further observed 
that when AG001 , AG002, AG003, AG004, AG005 and AG006 are mutated in Ashbya 
gossypii abnormal filament development was observed. This suggests a critical role for the 
gene products encoded by the mutated genes. 

The inventors of the present invention have demonstrated that the gene products of 
the invention are essential in Ashbya gossypii. This implies that chemicals which inhibit the 
function of the protein in fungi, particularly, filamentous fungi, are likely to have detrimental 
effects on fungi and are potentially good fungicide candidates. The present invention 
therefore provides methods of using a purified protein encoded by the gene sequence 
described below to identify inhibitors thereof, which can then be used as fungicides to 
suppress the growth of pathogenic fungi. 

Pathogenic fungi is defined as those capable of colonizing a host and causing 
disease. Examples of fungal pathogens include plant pathogens such as Septoria tritici, 
Stagnospora nodorum, Botrytis ctnerea, Fusarium graminearum, Magnaporthe grisea, 
Cochlioboius heterostrophus, Colletotrichum heterostrophus, Ustilago maydis, Erisyphe 
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graminis, plant pathogenic oomycetes such as Pythium ultimum and Phytophthora 
tnf estans, and human pathogens such as Candida albicans and Aspergillus fumigatus 

The present invention discloses novel nucleotide sequences derived from Ashbya 
gossypii designated as the AGOOIgene, the AG002 gene, the AG003 gene, the AG004 
gene, the AG005 gene and the AG006 gene. The nucleotide sequence of the genomic 
clones are set forth in SEQ ID NO: 1 . SEQ ID NO: 3. SEQ ID NO: 5. SEQ ID NO: 7, SEQ 
ID NO: 9 and SEQ ID NO: 1 1 respectively. The amino acid sequence encoded by the 
above sequences are set forth in SEQ ID NO: 2. SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO: 
8, SEQ ID NO:10 or SEQ ID NO:12 . The present Invention also includes nucleotide 
sequences substantially similar to those set forth in SEQ ID NO: 1 , SEQ ID NO: 3, SEQ ID 
NO: 5, SEQ ID NO: 7, SEQ ID NO: 9 OR SEQ ID NO: 1 1 and amino acid sequences 
substantially similar to those set out in SEQ ID NO: 2, SEQ ID NO:4, SEQ ID NO:6. SEQ ID 
NO: 8, SEQ ID NO: 10 or SEQ ID NO:12 

The present invention also encompasses fungal proteins whose amino acid sequence are 
substantially similar to the amino acid sequences set forth in SEQ ID NO: 2, SEQ ID NO:4, 
SEQ ID NO:6, SEQ ID NO: 8, SEQ ID NO:10 or SEQ ID NO:12. Encompassed by the 
present invention is a nucleotide sequence having a 20 base pair nucleotide portion 
identical in sequence to a 20 consecutive base pair portion of a sequence selected from 
the group consisting of SEQ ID NO: 1, SEQ ID NO: 3. SEQ ID NO: 5, SEQ ID NO: 7. SEQ 
ID NO: 9 OR SEQ ID NO: 1 1 . Pretended is a nucleotide sequence having a base pair 
nucleotide portion identical in sequence to a 18 consecutive base pair portion of the 
sequence set forth in SEQ ID NO: 1. Preferred is a nucleotide sequence having a base pair 
nucleotide portion identical in to a consecutive 9 base pair portion of the sequence set forth 
in SEQ ID NO:3. . Preferred is a nucleotide sequence having a base pair nucleotide portion 
identical in sequence to a 15 consecutive base pair portion of the sequence set forth in 
SEQ ID NO:5. . Preferred is a nucleotide sequence having a base pair nucleotide portion 
identical in sequence to a 14 consecutive base pair portion of the sequence set forth in 
SEQ ID NO:7. . Preferred is a nucleotide sequence having a base pair nucleotide portion 
identical in sequence to a 1 2 consecutive base pair portion of the sequence set forth in 
SEQ ID NO:9. . Preferred is a nucleotide sequence having a base pair nucleotide portion 
identical in sequence to a consecutive 10 base pair portion of the sequence set forth in 
SEQ ID NO:11. 
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In a particular embodiment, the present invention encompasses nucleic add 
sequences and amino acid sequences of filamentous fungi. Prefen^ed is a nucleotide 
sequence wherein the fungus is Ashbya gossypii. 

Further encompassed by the invention is a chimeric gene comprising a promoter opetably 
lini<ed to a nucleotide sequence according to the invention. Prefen-ed is a chimeric gene 
wherein the promoter is an inducible promoter. A further embodiment of the invention is a 
recombinant vector comprising a chimeric gene according to the invention wherein said 
vector is capable of being stably transformed into a host cell. 

Also included in the invention is a host cell comprising the vector according to ttie invention, 
wherein the nucleotide sequence is expressible In the host cell. Preferred Is a host cell , 
wherein the host cell is eukaryotic. Prefen-ed is a host cell, wherein the host cell is selected 
from the group consisting of a yeast cell and a fungal cell. More preferred is a host cell, 
wherein the host cell is a filamentous fungal cell. Particularly prefen-ed is a host ceil 
according to claim 24, wherein the host ceil is an Ashbya gossipii cell. 
Preferred is a host cell wherein the host cell is a proltaryotic cell. More preferred is a host 
cell wherein the host cell is a bacterial ceii. 

The present Invention also includes methods of using the AG001 to AG006 gene 
products as fungicide targets, based on the essentiality of the genes for normal growth and 
development Nomnal growth and development is defined as a growth rate substantially 
similar to that obsen^ed in wild type fungus, preferably greater than at least 50% the growth 
rate obsen/ed in wild type fungus and particularly greater than 10% the growth rate 
obesewed in wild type fungus. Normal growth and development may also be defined, when 
used in relation to filamentous fungi, as normal filament development (including normal 
septatlon and nomial nuclear migration and distribution), normal sporulation, and normal 
production of any infection structures (e.g. appressoria). Conversely suppressed or inhibited 
growth as used herein is defined as less than half the growth rate obsen/ed in wild type or 
lower where 10% that of the wild-type growth rate was obsen/ed or no growth was 
macroscopically detected at all or abnormal filament development. 

Furthemiore, the invention can be used in screening assays to identify inhibitors 
that are potential pesticides, particularly fungicides. Encompassed by the present invention 
is the use of sequences selected from the attached Sequence Listing to identify substances 
having antifungal activity; the use of sequences selected from the attached Sequence 
Usting to identify substances having pesticidal, particularly fungicidal, activity. 
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Further comprised is the use of an a DNA sequence selected from the Sequence 
Listing and variants thereof in a screening method for identifying compounds capable of 
inducing broad spectrum disease resistance in plants. 

Encompassed by the invention is a process for identifying compounds having fungicidal 
activity comprising the steps of: 

a) combining a protein according to claim 16 and a compound to be tested for the 
ability to bind to the protein, under conditions having conducive binding. 

b) selecting a compound identified by step a) that is capable of binding the protein; 

c) applying the identified compound from step b) to a fungus to test for fungicidal 
activity; and 

d) selecting compounds having fungicidal activity. 

Encompassed by the present invention is a process for identifying an inhibitor of a protein 
activity having an amino acid sequence according to claim 1 6 comprising: 

a) introducing SEQ ID NO: 1 , SEQ ID NO: 3, SEQ ID NO: 5, SEC ID NO: 7. SEQ ID 
NO: 9 or SEQ ID NO: 1 1 or nucleotide sequences substantially similar thereto into a host, 
such that the sequence is functionally expressible; 

b) combining the host cell of step a) with a compound to be tested for ability to inhibit 
the protein activity; 

c) over expressing the nucleotide sequence of step a); 

c) measuring the host cell growth in stepc); and 

d) selecting the compound that inhibits or suppresses host cell's normal growth or 
development in step c). 

Further encompassed is a compound having fungicidal activity which compound can be 
identified by a process for identifying compounds having fungicidal activity according to the 
invention. 

Further encompassed is a method of suppressing growth of a fungus comprising applying 
to the fungus a compound that inhibits the activity of a protein comprising the amino acid 
sequence according to the invention in an amount sufficient to suppress the growtii of the 
fungus. 

In a further embodiment according to the invention, a DNA sequence selected from 
the Sequence Listing may also be used for distinguishing among different species of plant 
pathogenic fungi and for distinguishing fungal pathogens from other pathogens such as 
bacteria. 
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In another pretended embodiment, the present invention describes a method for 
identifying chemicals having the ability to inhibit any one or more of AG001 , AG002, 
AG003, AG004, AG005 and AG006 activity in fungi preferably comprising the steps of: a) 
obtaining transgenic fungus and/or fungal cell, preferably stably transfonmed, comprising a 
non-native nucleotide sequence or an endogenous nucleotide sequences operably linked 
to non-native promoter, preferably an inducible promoter, encoding an enzyme having and 
activity and capable of overexpressing an enzymatically active AG001 . AG002, AG003, 
AG004, AG005 or AG006 gene product where overexpression of the gene product is 
suppresses or inhibits the nomial growth and development of the fungus; b) applying a 
compound to the transgenic fungus and/or fungal cell c) determining the growth and/or 
development of the transgenic fungus and/or fungal cell after application of the compound; 
d) comparing the growth and/or development of the transgenic fungus and/or fungal cell 
after application of the chemical to the growth and/or development of the corresponding 
transgenic fungus and/or fungal cell to which the compound was not applied; and e) 
selecting compound that does not results in reduction of the suppressed or inhibited growth 
and/or development in the transgenic fungus and/or fungal cell in comparison to the 
untreated transgenic fungus and/or fungal cell. 

In a preferred embodiment, the proteins having AG001 , AG002, AG003, AG004, 
AG005 or AG006 activities are encoded by nucleotide sequence derived from fungi, 
preferably filamentous fungi, particulariy from Ashbya gossypii, desirably identical or 
substantially similar to the nucleotide sequence set forth in SEQ ID NO:1, SEQ ID NO:3, 
SEQ ID NO:5. SEQ ID NO: 7. SEQ ID NO:9 or SEQ ID NO:1 1 . In another embodiment, the 
proteins having AG001, AG002, AG003, AG004, AG005 or AG006 activity are encoded by 
nucleotide sequences capable of encoding the amino acid sequences of: SEQ ID NO: 2, 
SEQ ID NO:4. SEQ ID NO:6, SEQ ID NO: 8, SEQ ID NO:10 or SEQ ID NO:12, In yet 
another embodiment, the proteins having AG001 , AG002, AG003, AG004, AG005 or 
AG006 activity have amino acid sequences identical or substantially similar to the amino 
acid sequence set forth in SEQ ID NO: 2, SEQ ID NO:4, SEQ ID NO:6. SEQ ID NO: 8, SEQ 
ID NO:10 or SEQ ID NO:12 respectively. 

The invention also provides a method for suppressing the growth of a fungus 
comprising the step of applying to the fungus a compound that inhibits the naturally 
occuning AG001 , AG002, AG003, AG004. AG005 and/or AG006 activity In the fungus. 
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Other objects and advantages of the present invention will become apparent to those 
skilled in the art from a study of the following description of the invention and non-limiting 
examples. 

Essentiality of the AG001 , AG002, AG003. AG004, AG005 and AG006 Genes in Ashbya 
gossypii Demonstrated by Gene Disruption 

Owing to the provision within the scope of this invention of a novel and powerful 
gene disruption process, there is no longer a need to know the exact biological function of 
the protein product encoded by a gene comprising one of the A. gossypii DNA sequences 
provided herein. 

As shown in the examples below, the identification of novel gene structures, as well as the 
essentiality of the AG001 , AG002, AG003, AG004, AG005 and AG006 genes for normal 
fungal growth and development, have been demonstrated for the first time In Ashbya 
gossypii using gene disruption techniques. Having established the essentiality of AG001 , 
AG002, AG003. AG004, AG005 and AG006 function In fungi and having identified the 
nucleic acid sequences encoding these essential activities, the inventors thereby provide an 
important and sought after tool for new pesticide, particularly fungicide, development. 

Recombinant Production of and Uses Thereof 

For recombinant production of AG001, AG002. AG003, AG004, AG005 and AG006 in a 
host organism, a nucleotide sequence encoding AG001, AG002. AG003, AG004, AG005 or 
AG006 protein is inserted into an expression cassette designed for the chosen host and 
introduced into the host where it is recombinantly produced. The choice of specific 
regulatory sequences such as promoter, signal sequence, 5' and 3* untranslated 
sequences, and enhancer appropriate for the chosen host is within the level of skill of the 
routineer in the art. The resultant molecule, containing the individual elements operably 
linked in proper reading frame, may be inserted into a vector capable of being transformed 
into the host cell. Suitable expression vectors and methods for recombinant production of 
proteins are well known for host organisms such as E. coli, yeast, and insect cells (see. 
e.g., Luckow and Summers, Bio/Technol. 6: 47 (1988), and baculovirus expression vectors. 
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e.g., those derived from the genome of Autographica califomica nuclear polyhedrosis virus 
(AcMNPV). A preferred baculovirusAinsect system is pAcHLT (Phanningen, San Diego, CA) 
used to transfect Spodoptera frugiperda Sf9 cells (ATCC) In the presence of linear 
Autographa califomica baculovirus DNA (Pharmigen, San Diego, CA). The resulting virus is 
used to infect HighRve Tricoplusia ni cells (Invitrogen, La Jolla, CA). Further preferred 
expression systems are commercially available such as Baculovirus expression systems: 
MaxBac 2.0 kit; Invitrogen, Calsbad, CA;BacPAK Baculovirus Expression System; 
CLONTECH, Palo Alto, CA; for Yeast expression vectors: pYEUraS; CLONTECH, Palo 
Alto, CA; EasySelect Pichia expression kit; Invitrogen, Calsbad, CA;ESP Yeast Protein 
Expression and Purification System; Stratagene, La Jolla, CA; E, coli expression vectors: 
pKK233-2; CLONTECH. Palo Alto, CA; pET3 series vectors; Stratagene, La Jolla, CA. 

In a preferred embodiment, the nucleotide sequence encoding a protein having 
AG001 , AG002, AG003. AG004, AG005 Or AG006 activity is derived from an eukaryote, 
such as a mammal, a fly or a yeast, but is preferably derived from a fungus, particularly a 
filamentous fungus. In a further preferred embodiment, the nucleotide sequence is identical 
or substantially similar to the nucleotide sequence set forth in SEQ ID NO: 1, SEQ ID NO: 3, 
SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 9 or SEQ ID NO: 1 1 , or encodes a protein 
having AG001 , AG002, AG003, AG004, AG005 or AG006 activity, whose amino acid 
sequence Is identical or substantially similar to the amino acid sequence set forth in SEQ ID 
NO: 2, SEQ ID NO: 4. SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 10 or SEQ ID NO: 12 
respectively. The nucleotide sequences set forth in SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID 
NO: 5, SEQ ID NO: 7, SEQ ID NO: 9 OR SEQ ID NO: 1 1 encode the protein comprising 
amino acid sequence is set forth in SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID 
NO: 8, SEQ ID NO: 10 OR SEQ ID NO: 12. In another prefen-ed embodiment, the 
nucleotide sequence is derived from a prokaryote, preferably a bacteria. 

Recombinantly produced AG001 , AG002, AG003, AG004, AG005, or AG006 is 
isolated and purified using a variety of standard techniques. The actual techniques that 
may be used will vary depending upon the host organism used, whether the protein Is 
designed for secretion, and other such factors familiar to the skilled artisan (see, e.g. 
chapter 16 of Ausubel, F. et aL, "Current Protocols In Molecular Biology", pub. by John 
Wiley & Sons, Inc. (1994). 
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Assays for Characterizing the AG001 . AG002. AG003. AG004. AG005 and AQ006 Proteins 

Recombinantly produced AG001 . AG002. AG003, AG004, AG005 and AG006 
proteins are useful for a variety of purposes. For example, they can be used In In vitro 
assays to screen known pestcidal, particulariy fungicidal chemicals whose target has not 
been identified to detennine if they Inhibit AG001 . AG002. AG003. AG004. AG005 or 
AG006. Such In vitro assays may also be used as more general screens to identify 
chemicals that inhibit such enzymatic activities and that are therefore novel pestldde. 
particulariy fungicide, candidates. Alternatively, recombinantly produced AG001, AG002, 
AG003, AG004, AG005 or AG006 proteins may be used to elucidate the complex stmcture 
of these molecules and to further characterize their association with known inhibitors in 
order to rationally design new inhibitory pesticides, particularly fungicides. Nucleotide 
sequences substantially similar to SEQ ID NO: 1 , SEQ ID NO: 3. SEQ ID NO: 5. SEQ ID 
NO: 7, SEQ ID NO: 9 OR SEQ ID NO: 1 1 and proteins substantially similar to SEQ ID NO: 
2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 10 OR SEQ ID NO: 12 from 
any source, Including microbial sources, can be used in the assays exemplified herein. 
Desirably such nucleotide sequences and proteins are derived from fungi. More desirably, 
they are derived from filamentous fungi, particularly Ashbya gossypii. Alternatively, such 
nucleotide sequences and proteins are derived from non-yeast sources, alternatively from 
non-Saccharomyces cervisiae sources. 

A simple assay can be developed to screen for compounds that affect nonmal 
functioning of the fungal-encoded activity. Such compounds are promising in vitro leads 
that can be tested for in vivo pesticidal, particularly fungicidal, activity. A nucleic acid 
sequence of the invention according to any one of the sequnces SEQ ID NO: 1 , SEQ ID 
NO: 3. SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 9 OR SEQ ID NO: 1 1 may be operably 
linked to a strong inducible promoter, such promoters being known in the art. The vector 
comprising the selected gene of the Invention operably linked to the selected inducible 
promoter may be transfomfied into bacteria, such as E. coli. Transformed E. coli hartsoring 
and functionally overexpressing expressing a AG001 . AG002. AG003, AG004, AG005 or 
AG006 gene may be grown In a 96-weII format for automated high-throughput screening 
where inducible over expression of the selected gene is lethal or suppresses growth of the 
host. Compounds that are effective in blocking function of the AG001 . AG002, AG003. 
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AG004, AG005 or AG006 protein results in bacterial growth. This growth is measured by 
simple turbidometric means. 

In another embodiment, an assay for inhibitors of the AG001, AG002, AG003, 
AG004, AG005 or AG006 activities uses transgenic fungi or fungal cells capable of 
overexpressing a nucleotide sequence having AG001, AG002, AG003, AG004, AG005 or 
AG006 activity respectively operably linked to a strong Inducible promoter e.g. , wherein 
the selected gene product is enzymatically active in the transgenic fungi and/or fungal cells 
and inducible overexpression of the gene inhibits and/ or suppresses growth and/or 
development of the fungus. The nucleotide sequence is preferably derived from an 
eukaryote, such as a yeast, but is preferably derived from a fungus and more particularly 
from a filamentous fungus. In a further preferred embodiment, the nucleic acid sequences 
set forth in SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 9 OR 
SEQ ID NO: 1 1 SEQ ID NO: 1 , SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 9 
OR SEQ ID NO: 11 encode enzymes having AG001, AG002, AG003, AG004,AG005 or 
AG006 activity respectivelyy, whose amino acid sequence is identical or substantially similar 
to the amino acid sequence set forth in SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ 
ID NO: 8, SEQ ID NO: 10 OR SEQ ID NO: 12. The transgenic fungus or fungal cells are 
grown in 96-weII format microtiter dishes for high-throughput screening. Compounds that 
are effective in blocking function of the AG001 , AG002, AG003, AG004, AG005 or AG006 
protein results in fungal growth. This growth is measured by methods known in the art. In a 
particular embodiment the transgenic fungus is Ashbya gossypli. 

Similar assays based on expression of the fungal genes of the invention in yeast, 
using appropiate expression systems as described above may also be used. 

In Vitro Inhibitor Assays: Discovery of Small Molecule Ligand that Interacts with Protein of 
Unknown Functk^n 

Novel technologies are being examined that can detect interactions between a 
protein and a ligand without knowing the biological function of the protein. A short 
description of three methods is presented, including fluorescence correlation spectroscopy, 
surface-enhanced laser desorption/ionlzation, and biacore technologies. Many more of 
these methods are currently being discovered, and some may be amenable to automated, 
large scale screening in light of this disclosure. 



JNSDCXJID: <WO_0022133A1J.> 



wo 00/22133 



-15- 



PCT/EP99/07501 



Fluorescence Correlation Spectroscopy (FCS) theory was developed In 1972 but It is 
only in recent years that the technology to perform FCS became available (Madge et al. 
(1972) Phys. Rev. Lett., 29: 705-708; Maiti et al. (1997) Proc. Natl. Acad. Sci. USA, 94: 
1 1753-1 1757). FCS measures the average diffusion rate of a fluorescent molecule within a 
small sample volume. The sample size can be as low as 103 fluorescent molecules and the 
sample volume as low as a the cytoplasm of a single bacterium. The diffusion rate Is a 
function of the mass of the molecule and decreases as the mass increases. FCS can 
therefore be applied to protein-ligand interaction analysis by measuring the change in mass 
and therefore in diffusion rate of a molecule upon binding. 

Surface-Enhanced Laser Desorption/lonization (SELDI) was invented by Hutchens 
and Yip during the late 1980's (Hutchens and Yip (1993) Rapid Commun. Mass Spectrom. 
7: 576-580). When coupled to a time-of-flight mass spectrometer (TOF), SELDI provides a 
mean to rapidly analyze molecules retained on a chip. It can be applied to ligand-protein 
interaction analysis by covalently binding the target protein on the chip and analyze by MS 
the small molecules retained by this protein (Worrell et al. (1998) Anal. Biochem. 70: 750- 
756). 

Biacore relies on changes in the refractive index at the surface layer upon binding of a - 
ligand to a protein immobilized on the layer. In this system, a collection of small ligands is 
injected sequentially in a 2-5 ul cell with the immobilized protein. Binding is detected by 
surface plasmon resonance (SPR) by recording laser light refracting from the surface. In 
general, the refractive index change for a given change of mass concentration at the 
surface layer, is practically the same for all proteins and peptides, allowing a single method 
to be applicable for any protein (Liedberg et al. (1983) Sensors Actuators 4: 299-304; 
Malmquist (1993) Nature. 361: 186-187). 

IV. In Vivo Inhibitor Assay 

In one embodiment, a suspected pesticide, particularly fungicide, for example 
identified by in vitro screening, is applied to fungi at various concentrations. After 
application of the suspected fungicide, its effect on the fungus, for example inhibition or 
suppression of growth and development is recorded. 
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The invention will be further described by reference to the following detailed 
examples. These examples are provided for purposes of Illustration only, and are not 
intended to be limiting unless othen^nse specified. 

EXAMPLES 

Standard recombinant DNA and molecular cloning techniques used here are well known in 
the art and are described by Sambrook, et al.. Molecular Cloning, eds., Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, NY (1989) and by T.J. Silhavy, M.L. Bemnan. and 
L.W. Enquist, Experiments with Gene Fusions, Cold Spring Harbor Laboratory. Cold Spring 
HariDor, NY (1984) and by Ausubel, P.M. et al.. Cun-ent Protocols in Molecular Biology, pub. 
by Greene Publishing Assoc. and Wiley-lntersdence (1987), 

Construction and characterization of a Genomic Library of A. gossypii (strain ATCC10895), 
identification of ORF and promoters is described in U.S. Patent Application Ser. No.: 
08/998,416 which is hereby incorporated by reference in its entirety. 

Example 1: Identification of Antifungal Drug Targets Represented in the Sequence Listing 

Gene disruptions of Ashbya gossypii genes are generated by a method using short flanking 
homology regions to produce gene targeting events. The short flanking homology regions 
are included within polymerase chain reaction primers of 65 nucleotide overall sequence 
length. Each of these 65-mers contains approximately 45 nucleotides homology to the 
target gene locus the target gene locus being identified as described in U.S. Patent 
Application Ser. No. 08/998,416 Incorporated above by reference, and 20 nucleotides 
homology (invariant) to a geneticin resistance gene module(also described in U.S. Patent 
Application Ser. No. 08/998,416 previously incorporated by reference) , with one primer 
(designated S1) anchored to the 5' end of the geneticin resistance module (using the 
Invariant sequence 5'-GCTAGGGATAACAGGGTAAT-3') (SEQ ID NO:13)and the other 
primer of the pair (designated S2) anchored to the 3' end of the geneticin resistance module 
(using the Invariant sequence 5*-AGGCATGCAAGCTTAGATCT-3') (SEQ ID NO:14). The 
PCR product resulting from the amprrficatlon of the geneticin resistance module with such 
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an S1/S2 primer pair thus consists of the module flanked by short flanking homology 
regions of ca. 45 nucleotides specific to the chosen gene disruption site. 

Once an S1/S2 primer pair is designed for a particular gene target, approximately 10 
ug of the desired geneticin resistance module is obtained by linearizing a vector containing 
the geneticin resistance gene positioned behind the an appropriate fungal promoter (for 
example, the Saccharomyces cerevisiae TEF1 promoter) and subjecting the linearized 
template to approximately 35 rounds of a PGR reaction consisting of the following steps: 
Step 1: Denaturation at 96 C for 30 seconds; Step 2: Primer annealing at 50 C for 30 
seconds; Step 3: Elongation reaction at 72 C for 2.5 minutes. Following the 35th round of 
this protocol, a final elongation period of 5 minutes at 72 C is carried out. 

Transformation of the PGR product resulting from amplification with the S1/S2 primer 
pair is done by electroporation as follows: 1) Inoculate 100 ml of AFM media (1% casein 
peptone, 2% glucose, 1% yeast extract, 0.1 % myo-inositol) with an Ashbya spore 
suspension of approximately 10^ spores. 2) Incubate at 30 G for a maximum of 18 hors at a 
shaker speed of 200 rpm. 3) Collect the resultant fungal mycelia by filtration and wash once 
with sterile water, 4) Resuspend 1 gram of mycelia (wet weight) in 40 ml of 50 mM 
potassium phosphate buffer, pH 7.5 containing 25mM DTT and incubate at 30 G for 30 
minutes with gentle shaking. 5) Gollect the mycelia by filtration and wash once with 50 ml of 
cold STM buffer (275 mM sucrose, 10 mM Tris-HGI, pH 7.5, 2 mM MgGlz). 6) Resuspend the 
mycelia to a dense mixture in STM buffer. 7) Mix approximately 150 ul of the mycelial 
mixture with 10 ug of PGR product (in a maximum volume of 50 ul) in an Eppendorf tube 
and transfer the mixture to an electroporation cuvette with a 4mM gap distance. 8) Apply an 
electric field pulse of 1.5 kV, 100 ohms, 25 uF which will result in a pulse length of 
approximately 2.3 milliseconds. Add 1 ml of AFM media to the cuvette and spread equal 
amounts onto 3 pre-dried AFM agar plates. 9) Incubate plates for a minimum of 4 hours at 
30 C. 1 0) Overiay the plates with 8 ml of a 0.5%agarose toplayer containing 
Geneticin/G418 at a final concentration of 200 ug/ml. 11) Incubate at 30 G for 
approximately 3 days to allow sufficient growth of geneticin resistant transformants. 

Verification of the desired transfomiation event resulting in homologous integration 
of the geneticin resistance module in the target of interest is achieved by PGR using 
verification primers designated G1 (positioned upstream of the 81 region) and G4 
(positioned downstream of the S2 region) and template DNA purified from putative Ashbya 
transformants. Additional verification primers designated G2 (5 - 
GTTTAGTGTGAGGATGTGATCTG-3') (SEQ ID N015)and G3 (5'- 
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TCGCAGACCGATACCAGGATC-3') (SEQ ID NO:16) are derived from the open reading 
frame of the selectable geneticin resistance gene such that the detection of a G1/G2 PGR 
product and or a G3/G4 PGR product of a predictable size serves to verify the desired gene 
disruption event. Also, verification of the desired gene disruption can be detemnined by 
standard DNA hybridization experiments. 

Determination of whether a gene is essential to growth of Ashbya can be achieved by 
the following analysis. The transfomiation of DNA fragments described above utilizes 
multinucleate Ashbya mycelia as recipients. Therefore a primary transformant able to grow 
on geneticin containing media originates as a mycelium containing cells at least one of 
which has at least one transformed nucleus, but usually containing non-transfonned nuclei 
as well. Thus, If an essential gene is disrupted in the transfomied nucleus, the essential 
gene product can. In many Instances, still be supplied by the non-transfomned nuclei within 
the same cell. Such primary transfonnante usually exhibit normal growth and sporulation, 
and spores are collected from primary transformants allowed to grow at 30 G for at least 5 
days. Since spores are uninucleate, however, transfomnants which have an essential gene 
disnjpted in nuclei containing the geneticin resistance cartridge will fail to yield spores which 
grow nomnally. If at all, on genetidn-containing media. 

S1 and S2 primer pairs usable to generate disruptions of the indicated genes are as follows: 

AQ001: S1: 5'-AGGACCAGTAGGTGGTTGCGGTGGAATATAATAATAAGAAGGAGA 
GCTAGGGATAACAGGGTAAT-3' (SEQ ID NO:17) 

S2: 5'-AAGTATTGAATCAACTATGTGAGTAGTTTCTTGTAGGGAGTCTCG 
AGGGATGGAAGCTTAGATGT-3'(SEQ ID NO:18) 

AG002: S1: 5'-GTGGGATGAGAGGAAGCTGGCACGAGCAAGCTGTAGAAAGAGAAG 
GCTAGGGATAACAQGGTAAT-3'(SEQ ID NO:19) 

S2: 5'-ATTATATTAGTATAGTCTAAAGTTGCAGQCAGTGGGTATTAAAGT 

AGGCATGCAAGCTTAGATCT-3*(SEQ ID N0:20) 

AG003: 81: 5'-AGTTGGGTAGTGTTTCGCGTQGTGGTCAGCCACCGAAGAACGCAG 
GCTAGGGATAAGAGGGTAAT-3'(SEQ ID NO:21) 

S2: 5'-TTAAAGAATGATAAAGAAGGAAAAACACCAGGAGGTTGCATAACA 

AGGGATGCAAGCTTAGATCT-3'(SEQ ID N0:22) 
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AG004: SI: 5'-GTGCGTGTCAGCGAGCATCTAATCAAGCTGCAAGGCGCCGGAAAT 
GCTAGGGATAACAGGGTAAT-3'(SEQ ID NO:23) 

S2: 5'-TTATCACATATTTCTAAGTTAATAGATATTTTTACTTAGTATGAA 

AGGCATGCAAGCTTAGATCT-3'(SEQ ID NO:24) 

AG006: S1: 5'-GAGAGAGACGCTACGGTACTACGAATTTCTCTGTAGAGTTGGAGA 
GCTAGGGATAACAGGGTAAT-3'(SEQ ID NO:25) 

S2: 5'-TACTATTGAGAATGTTCGCGACTGCATGTAAAGTCTCAAAAACTT 
AGGCATGCAAGCTTAGATCT-3'(SEQ ID NO:26) 

AG005: SI: 5'-AAATATAATAAAAATTGACAACTGGCTAGAAGTGATACGGCAGTT 
GCTAGGGATAACAGGGTAAT-3'(SEQ ID NO:27) 

S2: 5'-GCTCTTATAGTTCATGACGCATTCATATGGGTCATTCAGGTCTCT 

AGGGATGGAAGGTTAGATGT-3'(SEQ ID NO:28) 

The above disclosed embodiments are Illustrative. This disclosure of the invention will 
place one skilled in the art in possession of many variations of the invention. All such 
obvious and foreseeable variations are intended to be encompassed by the appended 
claims. 
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What is ciaimed is: 

1 . A nucleotide sequence substantially similar to any one of SEQ ID NO: 1 , SEQ ID NO: 
3, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 9 OR SEQ ID NO: 1 1 , 

2. The nucleotide sequence of claim 1 , wherein the nucleotide sequence is SEQ ID NO: 
1 , SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 9 OR SEQ ID NO: 11 . 

3. The nucleotide sequence of claim 1 or 2, wherein the nucleotide sequence is a fungal 
nucleotide sequence. 

4. The nucleotide sequence of claim 3, wherein the fungus is Ashbya gossypii. 

5. The nucleotide sequence of claim 1 , wherein the nucleotide sequence encodes an 
eunino add sequence substantially similar to any one of SEQ ID NO: 2, SEQ ID NO:4. SEQ 
ID NO:6. SEQ ID NO: 8, SEQ ID NO:10 or SEQ ID NO:12 respectively. 

6. A nucleotide sequence encoding an amino acid sequence according to any one of 
SEQ ID NO: 2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO: 8, SEQ ID NO:10 or SEQ ID 
NO:12. 

7. A nucleotide sequence having a 20 base pair nucleotide portion identical in sequence 
to a 20 consecutive base pair portion of a sequence selected from the group consisting of 
SEQ ID NO: 1 , SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 9 OR SEQ ID 
NO: 11. 

8. An amino acid sequence comprising an amino acid sequence encoded by a nucleic 
acid sequence substantially similar to any one of SEQ ID NO: 1 , SEQ ID NO: 3. SEQ ID 
NO: 5, SEQ ID NO: 7, SEQ ID NO: 9 OR SEQ ID NO: 1 1. 

9. An amino acid sequence substantially similar to any one of SEQ ID NO: 2, SEQ ID 
NO:4, SEQ ID NO:6. SEQ ID NO: 8. SEQ ID NO:10 or SEQ ID NO:12. 
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10. A chimeric gene comprising a promoter operably linked to a nucleotide sequence 
according to claim 1 or 2. 

1 1 . The chimeric gene of claim 10*8 wherein the promoter is an inducible promoter. 

12. A recombinant vector comprising a chimeric gene according to claim 1 1 wherein said 
vector is capable of being stably transformed into a host cell. 

13. A host cell comprising the vector according to claim 12, wherein the nucleotide 
sequence is expressible in the host cell. 

14. The host cell according to claim 13, wherein the host cell is an Ashbya gossipii cell. 

15. A process for identifying compounds having fungicidal activity comprising the steps of: 

a) combining a protein according to claim 16 and a compound to be tested for the 
ability to bind to the protein, under conditions having conducive binding, 

b) selecting a compound Identified by step a) that is capable of binding the protein; 

c) applying the identified compound from step b) to a fungus to test for fungicidal 
activity; and 

d) selecting compounds having fungicidal activity. 

16. A compound having fungicidal activity identifiable according to claim 15. 

17. A process for identifying an Inhibitor of a protein activity having an amino acid 
sequence according to claim 9 comprising: 

a) introducing SEQ ID NO: 1 , SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID 
NO: 9 or SEQ ID NO: 11 or nucleotide sequences substantially similar thereto into a host, 
such that the sequence is functionally expressible; 

b) combining the host cell of step a) with a compound to be tested for ability to inhibit 
the protein activity; 

c) over expressing the nucleotide sequence of step a); 

c) measuring the host cell growth in stepc); and 

d) selecting the compound that inhibits or suppresses host cell's nomial growth or 
development in step c). 
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1 8. A compound having fungicidal activity identifiable according to the process of claim 
170. 

19. A method of suppressing growth of a fungus comprsing applying to the fungus a 
compound that inhibits the activity of a protein comprising the amino acid sequence 
according to claim 9 in an amount sufficient to suppress the growth of the fungus. 

20. The method of claim 1 9 wherein the compound is selected from a group consisting of 
the compounds of claims 16 and claim 18. 
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SEQUENCE LISTING 

<1 1 0> Gaff ney, Thomas 
Wendland, Juergen 
Phiiippsen, Peter 

<120> Novel Fungal Genes Required For Normal Growth And 
Development 

<130>CGC2046 

<140> 
<141> 

<160>28 

<170> Patentin Ver. 2.0 

<210> 1 

<211> 624 

<212> DNA 

<213> Ashbya gossypii 

<220> 
<221> CDS 
<222> (1)..(624) 

<400> 1 

atg tct cag caa atg cat aac ccc agt ate agg aga aaa ttg gtg ate 48 
Met Ser Gin Gin Met His Asn Pro Ser lie Arg Arg Lys Leu Val lie 
15 10 15 

gtc gga gat ggt gca tgc ggg aaa aca tgt ctt ttg att gtg ttt gcc 96 



BNSOOCIO: <WO_0022133A1 J_> 



PCT/EP99/07501 

WO 00/22133, - 



Val Gly Asp Gly Ala Cys Gly Lys Thr Cys Leu Leu He Val Phe Ala 
20 25 30 

aag gga aag ttc cca cag gtg tat gtt cct acg gtt ttc gac aac tac 1 44 
Lys Gly Lys Phe Pro Gin Val Tyr Val Pro Thr Val Phe Asp Asn Tyr 
35 40 45 

gtt gca gat gtg gag gta gac ggc aga egg gtg gag ctt gcg ctt tgg 192 
Val Ala Asp Val Qlu Val Asp Gly Arg Arg Val Glu Leu Ala Leu Trp 
50 55 60 

gat acg gel ggg cag gag gat tac gac agg eta egg ceg tta teg tac 240 
Asp Thr Ala Gly Gin Glu Asp Tyr Asp Arg Leu Arg Pro Leu Ser Tyr 
65 70 75 80 

cca gac tec aat gtt gtg ttg ate tge tac teg att gac eta cca gac 288 
Pro Asp Ser Asn Val Val Leu He Cys Tyr Ser He Asp Leu Pro Asp 
85 90 95 

teg ttg gag aac gtg atg gag aag tgg ate age gag gtg eta tac ttc 336 
Ser Leu Glu Asn Val Met Glu Lys Trp He Ser Qlu Val Leu Tyr Phe 
100 105 110 

tgc cag ggt gtt ccg ate ate ttg gtg ggg tge aag get gac ttg egg 384 
Cys Gin Gly Val Pro He He Leu Val Gly Cys Lys Ala Asp Leu Arg 
115 120 125 

aac gat ccg caa gtg ate gag cag ttg aga cag cag gga cag cag cct 432 
Asn Asp Pro Gin Val He Glu Gin Leu Arg Gin Gin Gly Gin Gin Pro 
130 135 140 

gtc teg cag get cag gcg cag gag gta gcg gac cag ate ggc gcg gta 480 
Val Ser Gin Ala Gin Ala Gin Glu Val Ala Asp Gin He Gly Ala Val 
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145 150 155 160 

gag tac att gag tgc tct gca aag acc ggc ttt ggt gtg cgc gag gtg 528 
Glu Tyr lie Glu Cys Ser Ala Lys Thr Gly Phe Gly Val Arg Glu Val 
165 170 175 

ttt gag gcg gcc acg cgt get tec ttg atg ggg aaa eaa gge aag tct 576 
Phe Glu Ala Ala Thr Arg Ala Ser Leu Met Gly Lys Gin Gly Lys Ser 
180 185 190 

aag gcg aag tct gac aag aag aag aag aaa aag tgt gtg gtc ttg tag 624 
Lys Ala Lys Ser Asp Lys Lys Lys Lys Lys Lys Cys Val Val Leu 
195 200 205 



<210>2 
<211>207 
<212> PRT 

<213> Ashbya gossypli 
<400> 2 

Met Ser Gin Gin Met His Asn Pro Ser lie Arg Arg Lys Leu Val lie 
1 5 10 15 

Val Gly Asp Gly Ala Cys Gly Lys Thr Cys Leu Leu lie Val Phe Ala 
20 25 30 

Lys Gly Lys Phe Pro Gin Val Tyr Val Pro Thr Val Phe Asp Asn Tyr 
35 40 45 

Val Ala Asp Vat Glu Val Asp Gly Arg Arg Val Glu Leu Ala Leu Trp 
50 55 60 
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Asp Thr Ala Gly Gin Glu Asp Tyr Asp Arg Leu Arg Pro Leu Ser Tyr 
65 70 75 80 

Pro Asp Ser Asn Vai Val Leu He Cys Tyr Ser lie Asp Leu Pro Asp 
85 90 95 

Ser Leu Glu Asn Val Met Glu Lys Trp lie Ser Glu Val Leu Tyr Phe 
100 105 110 

Cys Gin Gly Vai Pro lie lie Leu Vai Gly Cys Lys Ala Asp Leu Arg 
115 120 125 

Asn Asp Pro Gin Val lie Glu Gin Leu Arg Gin Gin Gly Gin Gin Pro 
130 135 140 

Val Ser Gin Ala Gin Ala Gin Glu Val Ala Asp Gin lie Gly Ala Val 
145 150 155 160 

Glu Tyr lie Glu Cys Ser Ala Lys Thr Gly Phe Gly Val Arg Glu Val 
165 170 175 

Phe Glu Ala Ala Thr Arg Ala Ser Leu Met Gly Lys Gin Gly Lys Ser 
180 185 190 

Lys Ala Lys Ser Asp Lys Lys Lys Lys Lys Lys Cys Val Val Leu 
195 200 205 
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<210> 3 
<211> 675 
<212> DNA 

<213> Ashbya gossypli 
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<220> 
<221> CDS 
<222>(1)..(675) 

<400> 3 

atg cct ctg tgt ggg teg age teg teg teg aag cat cet ate gag ego 48 
Met Pro Leu Cys Gly Ser Ser Ser Ser Ser Lys His Pro lie Glu Arg 
15 10 15 

aag ate gte ate cte gga gae ggt get tge ggg aag aeg teg ctg ttg 96 
Lys lie Val lie Leu Gly Asp Gly Ala Cys Gly Lys Thr Ser Leu Leu 
20 25 30 

aae gtg ttc aeg cga ggg tac ttt ecg aag gtg tae gag ece aeg gta 144 
Asn Val Phe Thr Arg Gly Tyr Phe Pro Lys Val Tyr Glu Pro Thr Val 
35 40 45 

tte gaa aae tac ate cat gae ate ttc gtg gae aac cag cac ate aeg 1 92 
Phe Glu Asn Tyr He His Asp lie Phe Val Asp Asn Gin His lie Thr 
50 55 60 

ctg age ctg tgg gae act get ggg cag gag gag ttt gae egg ttg cga 240 
Leu Ser Leu Trp Asp Thr Ala Gly Gin Glu Glu Phe Asp Arg Leu Arg 
65 70 75 80 

teg ctg teg tac teg gae aca cae aeg att atg ctg tgt ttc teg gtg 288 
Ser Leu Ser Tyr Ser Asp Thr His Thr He Met Leu Cys Phe Ser Val 
85 90 95 

gae teg egg gae tog ctg gag aae gtc aag aac aag tgg gtg age gaa 336 
Asp Ser Arg Asp Ser Leu Glu Asn Val Lys Asn Lys Trp Val Ser Glu 
100 105 110 
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att gcg gac cac tgc gag ggc gtg aag ctg gtg eta gtg gcg ctg aag 384 
lie Ala Asp His Cys Glu Gly Val Lys Leu Val Leu Val Ala Leu Lys 
115 120 125 

tgc gac ttg cgc age age gac gag tac gge aae gag age gee ate aeg 432 
Cys Asp Leu Arg Ser Ser Asp Glu Tyr Gly Asn Glu Ser Ala lie Tlir 
130 135 140 

eeg ggg tec ate cag aae cag aag tae aac ggc ggc ggc ggc aac ggg 480 
Pro Gly Ser lie Gin Asn Gin Lys Tyr Asn Gly Gly Gly Gly Asn Gly 
145 150 155 160 

ctg ate ecc tae gac gag ggg ctg gcg atg gee aag cag att ggg gcg 528 
Leu lie Pro Tyr Asp Glu Gly Leu Ala Met Ala Lys Gin lie Gly Ala 
165 170 175 

ctg cgc tat ctg gag tgc age gee aag atg aae egt gge gtg aae gag 576 
Leu Arg Tyr Leu Glu Cys Ser Ala Lys Met Asn Arg Gly Val Asn Glu 
180 185 190 

gcg ttc acc gag get gcg cgc tgc gcg ctg act gcg aca eeg aag ggg 624 
Ala Phe Thr Glu Ala Ala Arg Cys Ala Leu Thr Ala Thr Pro Lys Gly 
195 200 205 

gee egg gac tct gcg ccc gag gee gaa age age agt tgt act ate atg 672 
Ala Arg Asp Ser Ala Pro Glu Ala Glu Ser Ser Ser Cys Thr lie Met 
210 215 220 

tga 675 



225 



wo 00/22133 




PCT/EP99/07501 



<210>4 
<211> 224 
<212> PRT 

<213> Ashbya gossypii 
<400> 4 

Met Pro Leu Cys Gly Ser Ser Ser Ser Ser Lys His Pro lie Glu Arg 
1 5 10 15 

Lys He Val ile Leu Gly Asp Gly Ala Cys Gly Lys Thr Ser Leu Leu 
20 25 30 

Asn Val Phe Thr Arg Gly Tyr Phe Pro Lys Val Tyr Glu Pro Thr Val 
35 40 45 

Phe Glu Asn Tyr lie His Asp lie Phe Val Asp Asn Gin His lie Thr 
50 55 60 

Leu Ser Leu Trp Asp Thr Ala Gly Gin Glu Glu Phe Asp Arg Leu Arg 
65 70 75 80 

Ser Leu Ser Tyr Ser Asp Thr His Thr lie Met Leu Cys Phe Ser Val 
85 90 95 

Asp Ser Arg Asp Ser Leu Glu Asn Val Lys Asn Lys Trp Val Ser Glu 
100 105 110 

lie Ala Asp His Cys Glu Gly Val Lys Leu Val Leu Val Ala Leu Lys 
115 120 125 

Cys Asp Leu Arg Ser Ser Asp Glu Tyr Gly Asn Glu Ser Ala He Thr 
130 135 140 
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Pro Giy Ser lie Gin Asn Gin Lys Tyr Asn Gly Giy Gly Gly Asn Gly 
145 150 155 160 

Leu lie Pro Tyr Asp Glu Gly Leu Ala Met Ala Lys Gin lie Gly Ala 
165 170 175 

Leu Arg Tyr Leu Glu Cys Ser Ala Lys Met Asn Arg Gly Val Asn Glu 
180 185 190 

Ala Phe Thr Glu Ala Ala Arg Cys Ala Leu Thr Ala Thr Pro Lys Gly 
195 200 205 

Ala Arg Asp Ser Ala Pro Glu Ala Glu Ser Ser Ser Cys Thr lie Met 
210 215 220 



<210> 5 

<211> 6216 

<212> DNA 

<21 3> Ashbya gossypii 

<220> 
<221>CDS 
<222> (1)..(6216) 

<400> 5 

atg ccc eta aag tgg got gcg aga aac aag aag oca oca tot gcg ccg 48 
Met Pro Leu Lys Trp Ala Ala Arg Asn Lys Lys Pro Pro Ser Ala Pro 
15 10 15 



cag teg tgc gca ago aag ccg tec agt gcg teg cag tea tec tgc gtt 96 
Gin Ser Cys Ala Ser Lys Pro Ser Ser Ala Ser Gin Ser Ser Cys Val 
20 25 30 



INSDOCID: <WO_0022133A1J_?^ 



wo 00/22133 




PCT/EP99A07501 



gac gag cgc ate age geg aeg ccg egg age teg ate teg teg aat tea 144 
Asp Glu Arg lie Ser Ala Thr Pro Arg Ser Ser lie Ser Ser Asn Ser 
35 40 45 

age ect aat tee aaa aat aat atg teg egt eat teg cae tee aat gga 1 92 
Ser Pro Asn Ser Lys Asn Asn Met Ser Arg His Ser His Ser Asn Gly 
50 55 60 

tct gtt tae tea gat gaa aea aea ttg aag aea gee eaa aec cae tae 240 
Ser Val Tyr Ser Asp Glu Thr Thr Leu Lys Thr Ala Gin Thr His Tyr 
65 70 75 80 

aea eaa eaa gge eaa eag gea aag eeg eaa eag cae aeg cag eag cag 288 
Thr Gin Gin Gly Gin Gin Ala Lys Pro Gin Gin His Thr Gin Gin Gin 
85 90 95 

cag eag eag cea eag aeg eeg atg eag tta eag gtg eeg aeg ggg eaa 336 
Gin Gin Gin Pro Gin Thr Pro Met Gin Leu Gin Val Pro Thr Gly Gin 
100 105 110 

geg eac aag egg aeg ctg aea tgt gag gac atg aag geg ggt geg cgc 384 
Ala His Lys Arg Thr Leu Thr Cys Glu Asp Met Lys Ala Gly Ala Arg 
115 120 125 

tgc gag gag eag gtg teg cee tgc teg eag eeg geg gge teg eeg gtg 432 
Cys Glu Glu Gin Val Ser Pro Cys Ser Gin Pro Ala Gly Ser Pro Val 
130 135 140 

cga cgt gga gge ggg ctg aac ggg gag aeg tac gac ggg act gtg ttt 480 
Arg Arg Gly Gly Gly Leu Asn Gly Glu Thr Tyr Asp Gly Thr Val Phe 
145 150 155 160 
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egg etc ggg tgg gtg aac aag gcg cag ggc gca gcg ccg gcg cgc gag 528 
Arg Leu Gly Trp Val Asn Lys Ala Gin Gly Ala Ala Pro Ala Arg Glu 
165 170 175 

ggg cga tac age cac cag cca aca gcg tea etg tet teg ate gga teg 576 
Gly Arg Tyr Ser His Gin Pro Thr Ala Ser Leu Ser Ser lie Gly Ser 
180 185 190 

gag egg eeg cac tte aeg gga ggg ggg acg age ggg tac cag tat gtc 624 
Glu Arg Pro His Phe Thr Gly Gly Gly Thr Ser Gly Tyr Gin Tyr Val 
195 200 205 

gcg act gcg tac egg ttg cac egt gcg cag etc aag ggc tgc ate etg 672 
Ala Thr Ala Tyr Arg Leu His Arg Ala Gin Leu Lys Gly Cys lie Leu 
210 215 220 

aat etg tac aag teg ggc etg aeg aat gtg aag tac tte gae eeg gcg 720 
Asn Leu Tyr Lys Ser Gly Leu Thr Asn Val Lys Tyr Phe Asp Pro Ala 
225 230 235 240 

etg gag eeg age get gcg gcg etg cag atg cac cag gag cga cag gag 768 
Leu Glu Pro Ser Ala Ala Ala Leu Gin Met His Gin Glu Arg Gin Glu 
245 250 255 

atg cec etc etg cag eeg cec etc cec tec gag get gtg eeg gcg ect 816 
Met Pro Leu Leu Gin Pro Pro Leu Pro Ser Glu Ala Val Pro Ala Pro 
260 265 270 

teg ate etg gag gcg tec atg gag age ggc gag etg egg etg gag tac 864 
Ser lie Leu Glu Ala Ser Met Glu Ser Gly Glu Leu Arg Leu Glu Tyr 
275 280 285 

etg age gag gcg tac ect eat eeg gae eta cag etg gae aag aag gae 912 
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Leu Ser Glu Ala Tyr Pro His Pro Asp Leu Gin Leu Asp Lys Lys Asp 
290 295 300 

ggc aag ate ctl tog ggg teg ctg gag teg ctg tgc cac gcc gtg ctg 960 
Gly Lys lie Leu Ser Gly Ser Leu Glu Ser Leu Cys His Ala Val Leu 
305 310 315 320 

ttc atg ccc acg act gac gcg aaa egg gte aca gac ate ttg ttg etc 1 008 
Phe Met Pro Tiir Thr Asp Ala Lys Arg Val Tlir Asp lie Leu Leu Leu 
325 330 335 

ctg ccg etc ctg gac gac ttc acg cgt gte etc aac tac ttc aac ctg 1 056 
Leu Pro Leu Leu Asp Asp Phe Thr Arg Vai Leu Asn Tyr Phe Asn Leu 
340 345 350 

ttc ggg aag gta ttt teg aag cac cac ccg gcg ggc gcg gcg gga gcc 1 104 
Phe Gly Lys Val Phe Ser Lys His His Pro Ala Gly Ala Ala Gly Ala 
355 360 365 

gat gac eta aat eag aac tac aac ate age aac gag aca gac cgc caa 1 1 52 
Asp Asp Leu Asn Gin Asn Tyr Asn lie Ser Asn Glu Thr Asp Arg Gin 
370 375 380 

ttg acg ctg egg eta gcc aca gtg gte eag aca gtg ctg gac atg ttc 1 200 
Leu Thr Leu Arg Leu Ala Thr Val Val Gin Thr Val Leu Asp Met Phe 
385 390 395 400 

ccg ggc ttt ctg ctg gac gac aag att ttc eag tec ctg gtg ata eta 1248 
Pro Gly Phe Leu Leu Asp Asp Lys lie Phe Gin Ser Leu Vai lie Leu 
405 410 415 

etc gat acg att tec ttc cac gat gaa gac acg teg eag gag ctg aag 1 296 
Leu Asp Thr lie Ser Phe His Asp Glu Asp Thr Ser Gin Giu Leu Lys 
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420 425 430 

gtg gcg ata gcg gag aaa cag acg gta ctg gtc aag ctg acc ggc ttl 1344 
Val Ala He Ala Glu Lys Gin Thr Val Leu Val Lys Leu Thr Gly Phe 
435 440 445 

gca aat gaa ccc ate cag tec gcg aaa etc gat gtt tta ata aag gtg 1 392 
Ala Asn Glu Pro lie Gin Ser Ala Lys Leu Asp Val Leu lie Lys Val 
450 455 460 

cag age ttc ctg aaa ctt gat acc gag aag gtt gee aac cag att cac 1 440 
Gin Ser Phe Leu Lys Leu Asp Thr Glu Lys Val Ala Asn Gin He His 
465 470 475 480 

aag ate aat eta acc ttt aat agg gta tgg age eca caa gee gat tat 1 488 
Lys He Asn Leu Thr Phe Asn Arg Val Trp Ser Pro Gin Ala Asp Tyr 
485 490 495 

tec eta ctt tac gae tct caa tat aca caa aag cac gtg gaa eta aac 1 536 
Ser Leu Leu Tyr Asp Ser Gin Tyr Thr Gin Lys His Val Glu Leu Asn 
500 505 510 

oca ttg gta ttt ttc aac gat aaa aat gta cag tat ttg agt cgc tta 1 584 
Pro Leu Val Phe Phe Asn Asp Lys Asn Val Gin Tyr Leu Ser Arg Leu 
515 520 525 

atg gtg tct cat ate tte tgc gaa gag acg gga ttt acg ccg aag aaa 1 632 
Met Val Ser His He Phe Cys Glu Glu Thr Gly Phe Thr Pro Lys Lys 
530 535 540 

cga gcg gag gtt ttg aca aaa tgg gtc caa ttg gga tgc aag ttt gag 1 680 
Arg Ala Glu Val Leu Thr Lys Trp Val Gin Leu Gly Cys Lys Phe Glu 
545 550 555 560 



3NSDOCI0:<WO 00221 33A1 I > 



wo 00/22133 




PCT/EP99/07S0I 



cga ctt ggg gac atg gtc tea tgg ctt gca att gcg aca gta ata tgc 1 728 
Arg Leu Gly Asp Met Val Ser Trp Leu Ala lie Ala Thr Val lie Cys 
565 570 575 

tec ate cec gtt tta cgc ttg aca agg acg tgg caa tat gtg cct gac 1776 
Ser lie Pro Val Leu Arg Leu Thr Arg Thr Trp Gin Tyr Val Pro Asp 
580 585 590 

tct tac ttg aag ata att ttt aag gat tgg gta cce acg att gtc cag 1 824 
Ser Tyr Leu Lys lie lie Phe Lys Asp Trp Val Pro Thr lie Val Gin 
595 600 605 

ttg gat cgc agg caa atg tec tee aag teg atg aac agt gtt tte ata 1 872 
Leu Asp Arg Arg Gin Met Ser Ser Lys Ser Met Asn Ser Val Phe lie 
610 615 620 

eta gee cea cet aat tta aac gat gcc ttt gtg agg gac aat gtg ate 1 920 
Leu Ala Pro Pro Asn Leu Asn Asp Ala Phe Val Arg Asp Asn Val He 
625 630 635 640 

cct tac ttt gge gac tta gtc att cac tec gat gat eta cce aga gac 1 968 
Pro Tyr Phe Gly Asp Leu Val He His Ser Asp Asp Leu Pro Arg Asp 
645 650 655 

age aag tat aag tac ttg gag aaa aag ata cgc cgc aca aaa aat gcc 2016 
Ser Lys Tyr Lys Tyr Leu Glu Lys Lys lie Arg Arg Thr Lys Asn Ala 
660 665 670 

ttt tac aag tgg cag cag aga eta gac cag gca tit gcg cag gat aga 2064 
Phe Tyr Lys Trp Gin Gin Arg Leu Asp Gin Ala Phe Ala Gin Asp Arg 
675 680 685 
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gat tct gcc agt tec ttt acg gac tec ttg cat ett gac gag gag gaa 21 12 
Asp Ser Ala Ser Ser Phe Thr Asp Ser Leu His Leu Asp Glu Glu Glu 
690 695 700 

cat gat gtg gea gat ttc tat cag tat tgg agg ttt cac atg aat ttg 21 60 
His Asp Val Ala Asp Phe Tyr Gin Tyr Tip Arg Phe His Met Asn Leu 
705 710 715 720 

cca cca atg aat att gaa aca att atg gaa atg agt tta aaa atg gaa 2208 
Pro Pro Met Asn lie Glu Thr lie Met Glu Met Ser Leu Lys Met Glu 
725 730 735 

cec ect tct att aat caa cag act tat teg aag aca tac tea acg aga 2256 
Pro Pro Ser He Asn Gin Gin Thr Tyr Ser Lys Thr Tyr Ser Thr Arg 
740 745 750 

agt gcg etc ate agt ggg get tat ttg ccg acc ttg ttt aca aca ttg 2304 
Ser Ala Leu lie Ser Gly Ala Tyr Leu Pro Thr Leu Phe Thr Thr Leu 
755 760 765 

tta cca tea tat tec ctg ttt cca cag gaa eta ctg att gea get gca 2352 
Leu Pro Ser Tyr Ser Leu Phe Pro Gin Glu Leu Leu He Ala Ala Ala 
770 775 780 

age acg cca tec acg aaa aat aat aac tea tct caa gcc tct aac egg 2400 
Ser Thr Pro Ser Thr Lys Asn Asn Asn Ser Ser Gin Ala Ser Asn Arg 
785 790 795 800 

ate age caa eta tct gtg aat teg aca ect cac tea aat gcc agt teg 2448 
lie Ser Gin Leu Ser Val Asn Ser Thr Pro His Ser Asn Ala Ser Ser 
805 810 815 

agt tec gca gcg age get gtt acc gga att gat aat ate gat gtg cca 2496 
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Ser Ser Ala Ala Ser Ala Val Thr Gly He Asp Asn lie Asp Val Pro 
820 825 830 

att aca aag gag ata tea tec aag tta tea aae aaa cag gtt tta ctg 2544 
lie Thr Lys Glu lie Ser Ser Lys Leu Ser Asn Lys Gin Val Leu Leu 
835 840 845 

aag ttc att agg gat atg ttc aac gta gat att aae gtt ttc eac ata 2592 
Lys Phe lie Arg Asp Met Phe Asn Val Asp He Asn Val Phe His lie 
850 855 860 

tct gat gat gtt att ttc aag tec att cgt gat tac gaa get aaa teg 2640 
Ser Asp Asp Val lie Phe Lys Ser lie Arg Asp Tyr Glu Ala Lys Ser 
865 870 875 880 

agg cet act agt gte gtt att gaa agt cec aag egg ttg teg ctt ett 2688 
Arg Pro Thr Ser Val Val lie Glu Ser Pro Lys Arg Leu Ser Leu Leu 
885 890 895 

tct teg gtc tct cet gat gta tct get gtc age agt gea ttg gaa aat 2736 
Ser Ser Val Ser Pro Asp Val Ser Ala Val Ser Ser Ala Leu Glu Asn 
900 905 910 

ttg gat etg ttc aaa aat ttt aac tec agt tct gat gat ate gee gaa 2784 
Leu Asp Leu Phe Lys Asn Phe Asn Ser Ser Ser Asp Asp He Ala Glu 
915 920 925 

ttt ace gta cag gtg gtg ttg aaa tgt gea age ttg gaa aag att ttt 2832 
Phe Thr Val Gin Val Val Leu Lys Cys Ala Ser Leu Glu Lys lie Phe 
930 935 940 

gat ate ttg gtc tta aca ago egg gtg ttc tec aac etc gta aca act 2880 
Asp He Leu Val Leu Thr Ser Arg Val Phe Ser Asn Leu Val Thr Thr 
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aca gat ttg gtt tec tat ttt aat agt gaa aag gca agg egg gaa aag 2928 
Thr Asp Leu Val Ser Tyr Phe Asn Ser Glu Lys Ala Arg Arg Glu Lys 
965 970 975 

tea gge get eaa cac aat ggt cag eac tct att ggt ttg tta gat ttt 2976 
Ser Gly Ala Gin His Asn Gly Gin His Ser lie Gly Leu Leu Asp Phe 
980 985 990 

gea ttg att age eta att atg gat aat gag cte ttt gea gag aee ttt 3024 
Ala Leu He Ser Leu lie Met Asp Asn Glu Leu Phe Ala Glu Thr Phe 
995 1000 1005 

ttt aac aae tae aaa agt ttt acg acg acg ttg tgc gta etg gaa aac 3072 
Phe Asn Asn Tyr Lys Ser Phe Thr Thr Thr Leu Cys Val Leu Glu Asn 
1010 1015 1020 

ttg gea aag aga ttt ate ggt geg aaa tee tea gee ata tet att agt 31 20 
Leu Ala Lys Arg Phe lie Gly Ala Lys Ser Ser Ala He Ser lie Ser 
1025 1030 1035 1040 

eta ate aat aag tta egg aat tet gaa tea tee egg eag ata eea eet 31 68 
Leu lie Asn Lys Leu Arg Asn Ser Glu Ser Ser Arg Gin lie Pro Pro 
1045 1050 1055 

tet aet aee tee aae eag ttt tea geg agt gge ate ttt aag eea tea 321 6 
Ser Thr Thr Ser Asn Gin Phe Ser Ala Ser Gly lie Phe Lys Pro Ser 
1060 1065 1070 

tat gat gag ctt aaa ttc cot gtc tgg gat ctt aag gtc ace age gte 3264 
Tyr Asp Glu Leu Lys Phe Pro Val Trp Asp Leu Lys Val Thr Ser Val 
1075 1080 1085 
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gaa ggc tgt ccg eta gac tac ctt gca aag att cag ate gga gta ttg 3312 
Glu Gly Cys Pro Leu Asp Tyr Leu Ala Lys He Gin He Gly Val Leu 
1090 1095 1100 

gaa tea eta tae cat ttg att aga gag cat tat gcg gac ttc ace gat 3360 
Glu Ser Leu Tyr His Leu lie Arg Glu His Tyr Ala Asp Phe Thr Asp 
1105 1110 1115 1120 

gat etc get aac aac aaa ace ttt ctg gat att ctg aag ate att aac 3408 
Asp Leu Ala Asn Asn Lys Thr Phe Leu Asp lie Leu Lys lie lie Asn 
1125 1130 1135 

cag gag gtt tat gat gag tgg gac aaa aga tta gat gae eta agg aat 3456 
Gin Glu Val Tyr Asp Glu Trp Asp Lys Arg Leu Asp Asp Leu Arg Asn 
1140 1145 1150 

aat aat aac agt age cag aag agg aag aac agt tgc gat gat aat tet 3504 
Asn Asn Asn Ser Ser Gin Lys Arg Lys Asn Ser Cys Asp Asp Asn Ser 
1155 1160 1165 

agt gee aag att act ttc cat gtt aat gat get ega ect gaa aac tec 3552 
Ser Ala Lys lie Thr Phe His Val Asn Asp Ala Arg Pro Glu Asn Ser 
1170 1175 1180 

aat gag aat aag egg ggt gcg gcg acg aat ttg ggg gat age tee tta 3600 
Asn Glu Asn Lys Arg Gly Ala Ala Thr Asn Leu Gly Asp Ser Ser Leu 
1185 1190 1195 1200 

gca gca ttg gaa aaa ctt caa tgt aca tta cag gat eta tac gtg aag 3648 
Ala Ala Leu Glu Lys Leu Gin Cys Thr Leu Gin Asp Leu Tyr Val Lys 
1205 1210 1215 
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att aag tec tea tat caa cgc eaa tta tat cgt cca ttg ggc gtc aca 3696 
He Lys Ser Ser Tyr Gin Arg Gin Leu Tyr Arg Pro Leu Gly Val Thr 
1220 1225 1230 

aga aat tgc agg aaa gtt cac gat atg ctg tgc caa ttt cag ccg cag 3744 
Arg Asn Cys Arg Lys Val His Asp Met Leu Cys Gin Phe Gin Pro Gin 
1235 1240 1245 

act agt atg tec get ctt ate atg aat gga tct agt gac aca ctt gat 3792 
Thr Ser Met Ser Ala Leu lie Met Asn Gly Ser Ser Asp Thr Leu Asp 
1250 1255 1260 

aag atg gtt ace gaa ttc cag gee ctg aaa cac ace gat tat gat gat 3840 
Lys Met Val Thr Glu Phe Gin Ala Leu Lys His Thr Asp Tyr Asp Asp 
1265 1270 1275 1280 

att att aat tgg att tac aaa tta gat eat ttt att ace teg aaa eta 3888 
He lie Asn Trp He Tyr Lys Leu Asp His Phe lie Thr Ser Lys Leu 
1285 1290 1295 

aag ctt gtt teg aac caa gac tgg att caa gtg teg caa att tta gag 3936 
Lys Leu Val Ser Asn Gin Asp Trp He Gin Val Ser Gin He Leu Glu 
1300 1305 1310 

tct ttg teg aat gat tct ctt gtt get ttg ttc aat tat cca ttg cat 3984 
Ser Leu Ser Asn Asp Ser Leu Val Ala Leu Phe Asn Tyr Pro Leu His 
1315 1320 1325 

gcg gaa tct aat aat gta att gca agt gga agt tct cag ttg gat gat 4032 
Ala Glu Ser Asn Asn Val He Ala Ser Gly Ser Ser Gin Leu Asp Asp 
1330 1335 1340 

ctt caa att ttg gat ata ttc acc tgg tta tea acg ctt gag agt ggt 4080 
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Leu Gin lie Leu Asp lie Phe Thr Trp Leu Ser Thr Leu Glu Ser Gly 
1345 1350 1355 1360 

tea gca cac att att gat aag ttc cct get age gtt cag ttg ata gtc 41 28 
Ser Ala His lie lie Asp Lys Phe Pro Ala Ser Val Gin Leu lie Val 
1365 1370 1375 

aga ctg cat ttg tot ctg act aaa ttt ttt act gtg eat att gcc eat 41 76 
Arg Leu His Leu Ser Leu Thr Lys Phe Phe Thr Val His lie Ala His 
1380 1385 1390 

ctg cat tct acc tat gag gcc aga gtt aat act tgt tea ett ate ttg 4224 
Leu His Ser Thr Tyr Glu Ala Arg Val Asn Thr Cys Ser Leu lie Leu 
1395 1400 1405 

gag ata etc aae ttt gtt eat gtt aag aat gee aat gtt aat tta ttc 4272 
Glu lie Leu Asn Phe Val His Val Lys Asn Ala Asn Val Asn Leu Phe 
1410 1415 1420 

cat tct gat gat get ggg gag ggt tct atg gcc aca att teg eca cat 4320 
His Ser Asp Asp Ala Gly Glu Gly Ser Met Ala Thr lie Ser Pro His 
1425 1430 1435 1440 

gtc eca tct ttc ate gaa aca gcc ata gaa aae gcc ate ata agt cea 4368 
Val Pro Ser Phe He Glu Thr Ala lie Glu Asn Ala lie lie Ser Pro 
1445 1450 1455 

gaa tec cga ttt ttt gag gtt tea tgg aag caa gcc tat aag aca ata 441 6 
Glu Ser Arg Phe Phe Glu Val Ser Trp Lys Gin Ala Tyr Lys Thr lie 
1460 1465 1470 

tec gag aaa gat gag aag ttg acg ttc att gga tct gtg ctt acc ggg 4464 
Ser Glu Lys Asp Glu Lys Leu Thr Phe lie Gly Ser Val Leu Thr Gly 
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1475 1480 1485 

tta gat aaa teg acg gcg cac ttt ttg gat gcc gat aac agg cag cct 4512 
Leu Asp Lys Ser Thr Ala His Phe Leu Asp Ala Asp Asn Arg Gin Pro 
1490 1495 1500 

gtt agg ccc aag aat ttt teg cct tgc ccg ggt tgg ttt ate tct cgt 4560 
Val Arg Pro Lys Asn Phe Ser Pro Cys Pro Gly Trp Phe He Ser Arg 
1505 1510 1515 1520 

ctg ttg gag ate act ggc eta gtt cct aac atg age att gaa aat tec 4608 
Leu Leu Glu lie Thr Gly Leu Val Pro Asn Met Ser lie Glu Asn Ser 
1525 1530 1535 

aaa atg ate aac ttt gae aaa agg ega tte ate aat aac ata gtg ata 4656 
Lys Met lie Asn Phe Asp Lys Arg Arg Phe He Asn Asn He Val lie 
1540 1545 1550 

aac tat caa gae ttg att cea aat act gaa cag ett ccg tct cat gat 4704 
Asn Tyr Gin Asp Leu He Pro Asn Thr Glu Gin Leu Pro Ser His Asp 
1555 1560 1565 

gat gaa aaa tec gca cat caa ttt ggg tct ate ett ttc eat tat ggc 4752 
Asp Glu Lys Ser Ala His Gin Phe Gly Ser lie Leu Phe His Tyr Gly 
1570 1575 1580 

aec gag tea tog att aag gca ttt aga aaa get agt aag gag get get 4800 
Thr Glu Ser Ser lie Lys Ala Phe Arg Lys Ala Ser Lys Glu Ala Ala 
1585 1590 1595 1600 

tea aat gag gca aga aaa ttg aag ttt caa gca atg ggc ttg ttc aat 4848 
Ser Asn Glu Ala Arg Lys Leu Lys Phe Gin Ala Met Gly Leu Phe Asn 
1605 1610 1615 
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gat ate eta gte act gaa gtc tae aag gtg eag aga gat caa aag aaa 4896 
Asp He Leu Val Thr Glu Vai Tyr Lys Val Gin Arg Asp Gin Lys Lys 
1620 1625 1630 

eag gaa eag tta ace gta eag gaa cat gag gca aaa aga tea gtc ttg 4944 
Gin Glu Gin Leu Thr Val Gin Glu His Glu Ala Lys Arg Ser Val Leu 
1635 1640 1645 

att caa eac cca aac aaa gtg tct gtc tet teg get tea tet tea gtc 4992 
lie Gin His Pro Asn Lys Val Ser Val Ser Ser Ala Ser Ser Ser Val 
1650 1655 1660 

tct ggg tet tee agt gge tct act get aga act tct aat cet get cat 5040 
Ser Gly Ser Ser Ser Gly Ser Thr Ala Arg Thr Ser Asn Pro Ala His 
1665 1670 1675 1680 

get get tae gcg tta aat atg gee ggg tec tta tea att tea get gee 5088 
Ala Ala Tyr Ala Leu Asn Met Ala Gly Ser Leu Ser lie Ser Ala Ala 
1685 1690 1695 

aga eat ggt aga age tct gtt tea tet agg agt teg gta ata tea aat 5136 
Arg His Gly Arg Ser Ser Val Ser Ser Arg Ser Ser Val He Ser Asn 
1700 1705 1710 

aec gca act get act tec cca gca agt gge get tec cca aac caa ace 51 84 
Thr Ala Thr Ala Thr Ser Pro Ala Ser Gly Ala Ser Pro Asn Gin Thr 
1715 1720 1725 

ago acc tct cat cat ggg gge atg ggt aaa aaa att ggt gge ttt ttg 5232 
Ser Thr Ser His His Gly Gly Met Gly Lys Lys lie Gly Gly Phe Leu 
1730 1735 1740 
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agg agg cca ttc tec ate agt gga ttt acc teg tea tec tct caa tat 5280 
Arg Arg Pro Phe Ser lie Ser Gly Phe Thr Ser Ser Ser Ser Gin Tyr 
1745 1750 1755 1760 

ace aca acg tea gtt gtg ctg tct ggc gtc eag get aac ggc tct ata 5328 
Thr Thr Thr Ser Val Val Leu Ser Gly Val Gin Ala Asn Gly Ser He 
1765 1770 1775 

tee cca tat gag eta cec gaa etc act tec gaa ata caa gat aca aag 5376 
Ser Pro Tyr Glu Leu Pro Glu Leu Thr Ser Glu He Gin Asp Thr Lys 
1780 1785 1790 

ate gtc act gtc ate aag act ttt gag ate aaa teg tge ate caa ate 5424 
He Val Thr Val He Lys Thr Phe Glu He Lys Ser Cys lie Gin He 
1795 1800 1805 

aac aac tae agg eag gat eet gat atg atg eat tgt ttt aag att gtt 5472 
Asn Asn Tyr Arg Gin Asp Pro Asp Met Met His Cys Phe Lys He Val 
1810 1815 1820 

atg gag gat ggt aca caa cat ace ett caa tgt atg gac gae get gat 5520 
Met Glu Asp Gly Thr Gin His Thr Leu Gin Cys Met Asp Asp Ala Asp 
1825 1830 1835 1840 

atg cat gaa tgg atg aag gee att aca etc tct aaa aga tae tec ttc 5568 
Met His Glu Trp Met Lys Ala tie Thr Leu Ser Lys Arg Tyr Ser Phe 
1845 1850 1855 

cat tct aaa aga ttt aag ggt aaa aca tea aat aaa ate ttc ggt gta 561 6 
His Ser Lys Arg Phe Lys Gly Lys Thr Ser Asn Lys He Phe Gly Val 
1860 1865 1870 

ccg gta gaa gac gtt tgc gaa aga gaa gga gcg tta ata ccc aat att 5664 
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Pro Val Glu Asp Val Cys Glu Arg Glu Gly Ala Leu lie Pro Asn lie 
1875 1880 1885 

ata gtg aaa ttg ttg gat gaa ate gag ttg cgc ggg ctt gat gaa gtg 5712 
He Val Lys Leu Leu Asp Glu lie Glu Leu Arg Gly Leu Asp Glu Val 
1890 1895 1900 

ggc eta tat agg gtg cct ggt tec gtg gge age ate aat gea cte aag 5760 
Gly Leu Tyr Arg Val Pro Gly Ser Val Gly Ser lie Asn Ala Leu Lys 
1905 1910 1915 1920 

aat gca ttt gac gat gag ggg get gtt eac aae act ttt acg ctg gaa 5808 
Asn Ala Phe Asp Asp Glu Gly Ala Val His Asn Thr Phe Thr Leu Glu 
1925 1930 1935 

gat gac cgt tgg ttt gaa ata aat act att gee ggg tgt ttt aaa eta 5856 
Asp Asp Arg Trp Phe Glu lie Asn Thr lie Ala Gly Cys Phe Lys Leu 
1940 1945 1950 

tac cte agg gaa ctt cct gaa tct ttg tte aca aat gaa aag gtg gac 5904 
Tyr Leu Arg Glu Leu Pro Glu Ser Leu Phe Thr Asn Glu Lys Val Asp 
1955 1960 1965 

gag tte gtt aat ate atg ace get tac aag aae eat gag gtt gat eta 5952 
Glu Phe Val Asn lie Met Thr Ala Tyr Lys Asn His Glu Val Asp Leu 
1970 1975 1980 

tec eag tte eag aat ggt ata aag acg ctg ctg agt ace ttg cct gtt 6000 
Ser Gin Phe Gin Asn Gly He Lys Thr Leu Leu Ser Thr Leu Pro Val 
1985 1990 1995 2000 

tte aat tac cat att eta aaa egg ctg tte ttg cat etc aac cgc gtt 6048 
Phe Asn Tyr His He Leu Lys Arg Leu Phe Leu His Leu Asn Arg Val 



BNSOOCID: <WO_0028133A1_L> 



wo 00/2213: 




PCT/EP99/07501 



2005 2010 2015 

cac cag cat gtt gag aat aac aga atg gat get age aac ttg gca att 6096 
His Gin His Val Glu Asn Asn Arg Met Asp Ala Ser Asn Leu Ala He 
2020 2025 2030 

gtg ttt teg atg tet ttc ate aac caa gat gat ctt gee agt acg atg 61 44 
Val Phe Ser Met Ser Phe He Asn Gin Asp Asp Leu Ala Ser Thr Met 
2035 2040 2045 

ggg cec act ttg ggt ttg ctg caa atg eta eta cag cat ctg att aga 61 92 
Gly Pro Thr Leu Gly Leu Leu Gin Met Leu Leu Gin His Leu lie Arg 
2050 2055 2060 

aac cea gag eat tac ttc acc tga 621 6 

Asn Pro Glu His Tyr Phe Thr 
2065 2070 



<210> 6 

<211>2071 

<212> PRT 

<213> Ashbya gossypii 

<400>6 

Met Pro Leu Lys Trp Ala Ala Arg Asn Lys Lys Pro Pro Ser Ala Pro 
15 10 15 

Gin Ser Cys Ala Ser Lys Pro Ser Ser Ala Ser Gin Ser Ser Cys Val 
20 25 30 

Asp Glu Arg lie Ser Ala Thr Pro Arg Ser Ser lie Ser Ser Asn Ser 
35 40 45 
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Ser Pro Asn Ser Lys Asn Asn Met Ser Arg His Ser His Ser Asn Gly 
50 55 60 

Ser Val Tyr Ser Asp Glu Thr Thr Leu Lys Thr Ala Gin Thr His Tyr 
65 70 75 80 

Thr Gin Gin Gly Gin Gin Ala Lys Pro Gin Gin His Tlir Gin Gin Gin 
85 90 95 

Gin Gin Gin Pro Gin Thr Pro Met Gin Leu Gin Val Pro Thr Gly Gin 
100 105 110 

Ala His Lys Arg Thr Leu Thr Cys Glu Asp Met Lys Ala Gly Ala Arg 
115 120 125 

Cys Glu Glu Gin Val Ser Pro Cys Ser Gin Pro Ala Gly Ser Pro Val 
130 135 140 

Arg Arg Gly Gly Gly Leu Asn Gly Glu Thr Tyr Asp Gly Thr Val Phe 
145 150 155 160 

Arg Leu Gly Trp Val Asn Lys Ala Gin Gly Ala Ala Pro Ala Arg Glu 
165 170 175 

Gly Arg Tyr Ser His Gin Pro Thr Ala Ser Leu Ser Ser lie Gly Ser 
180 185 190 

Glu Arg Pro His Phe Thr Gly Gly Gly Thr Ser Gly Tyr Gin Tyr Val 
195 200 205 

Ala Thr Ala Tyr Arg Leu His Arg Ala Gin Leu Lys Gly Cys He Leu 
210 215 220 
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Asn Leu Tyr Lys Ser Gly Leu Thr Asn Val Lys Tyr Phe Asp Pro Ala 
225 230 235 240 

Leu Glu Pro Ser Ala Ala Ala Leu Gin Met His Gin Glu Arg Gin Glu 
245 250 255 

Met Pro Leu Leu Gin Pro Pro Leu Pro Ser Glu Ala Val Pro Ala Pro 
260 265 270 

Ser lie Leu Giu Ala Ser Met Glu Ser Gly Glu Leu Arg Leu Glu Tyr 
275 280 285 

Leu Ser Glu Ala Tyr Pro His Pro Asp Leu Gin Leu Asp Lys Lys Asp 
290 295 300 

Gly Lys lie Leu Ser Gly Ser Leu Glu Ser Leu Cys His Ala Val Leu 
305 310 315 320 

Phe Met Pro Thr Thr Asp Ala Lys Arg Va! Thr Asp lie Leu Leu Leu 
325 330 335 

Leu Pro Leu Leu Asp Asp Phe Thr Arg Val Leu Asn Tyr Phe Asn Leu 
340 345 350 

Phe Gly Lys Val Phe Ser Lys His His Pro Ala Gly Ala Ala Gly Ala 
355 360 365 

Asp Asp Leu Asn Gin Asn Tyr Asn lie Ser Asn Glu Thr Asp Arg Gin 
370 375 380 

Leu Thr Leu Arg Leu Ala Thr Val Val Gin Thr Val Leu Asp Met Phe 
385 390 395 400 
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Pro Gly Phe Leu Leu Asp Asp Lys He Phe Gin Ser Leu Val lie Leu 
405 410 415 

Leu Asp Thr lie Ser Phe His Asp Glu Asp Thr Ser Gin Glu Leu Lys 
420 425 430 

Val Ala lie Ala Glu Lys Gin Thr Val Leu Val Lys Leu Thr Gly Phe 
435 440 445 

Ala Asn Glu Pro lie Gin Ser Ala Lys Leu Asp Val Leu lie Lys Val 
450 455 460 

Gin Ser Phe Leu Lys Leu Asp Thr Glu Lys Val Ala Asn Gin lie His 
465 470 475 480 

Lys lie Asn Leu Thr Phe Asn Arg Val Trp Ser Pro Gin Ala Asp Tyr 
485 490 495 

Ser Leu Leu Tyr Asp Ser Gin Tyr Thr Gin Lys His Val Glu Leu Asn 
500 505 510 

Pro Leu Val Phe Phe Asn Asp Lys Asn Val Gin Tyr Leu Ser Arg Leu 
515 520 525 

Met Val Ser His lie Phe Cys Glu Glu Thr Gly Phe Thr Pro Lys Lys 
530 535 540 

Arg Ala Glu Val Leu Thr Lys Trp Val Gin Leu Gly Cys Lys Phe Glu 
545 550 555 560 

Arg Leu Gly Asp Met Val Ser Trp Leu Ala lie Ala Thr Val lie Cys 
565 570 575 
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Ser lie Pro Val Leu Arg Leu Thr Arg Thr Trp Gin Tyr Val Pro Asp 
580 585 590 

Ser Tyr Leu Lys lie lie Phe Lys Asp Trp Val Pro Thr He Val Gin 
595 600 605 

Leu Asp Arg Arg Gin Met Ser Ser Lys Ser Met Asn Ser Val Phe lie 
610 615 620 

Leu Ala Pro Pro Asn Leu Asn Asp Ala Phe Val Arg Asp Asn Val lie 
625 630 635 640 

Pro Tyr Phe Gly Asp Leu Val lie His Ser Asp Asp Leu Pro Arg Asp 
645 650 655 

Ser Lys Tyr Lys Tyr Leu Glu Lys Lys lie Arg Arg Thr Lys Asn Ala 
660 665 670 

Phe Tyr Lys Trp Gin Gin Arg Leu Asp Gin Ala Phe Ala Gin Asp Arg 
675 680 685 

Asp Ser Ala Ser Ser Phe Thr Asp Ser Leu His Leu Asp Glu Glu Glu 
690 695 700 

His Asp Val Ala Asp Phe Tyr Gin Tyr Trp Arg Phe His Met Asn Leu 
705 710 715 720 

Pro Pro Met Asn lie Glu Thr lie Met Glu Met Ser Leu Lys Met Glu 
725 730 735 

Pro Pro Ser lie Asn Gin Gin Thr Tyr Ser Lys Thr Tyr Ser Thr Arg 
740 745 750 
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Ser Ala Leu He Ser Gly Ala Tyr Leu Pro Thr Leu Phe Thr Thr Leu 
755 760 765 

Leu Pro Ser Tyr Ser Leu Phe Pro Gin Glu Leu Leu lie Ala Ala Ala 
770 775 780 

Ser Thr Pro Ser Thr Lys Asn Asn Asn Ser Ser Gin Ala Ser Asn Arg 
785 790 795 800 

lie Ser Gin Leu Ser Val Asn Ser Thr Pro His Ser Asn Ala Ser Ser 
805 810 815 

Ser Ser Ala Ala Ser Ala Val Thr Gly lie Asp Asn lie Asp Val Pro 
820 825 830 

lie Thr Lys Glu lie Ser Ser Lys Leu Ser Asn Lys Gin Val Leu Leu 
835 840 845 

Lys Phe lie Arg Asp Met Phe Asn Val Asp lie Asn Val Phe His He 
850 855 860 

Ser Asp Asp Val lie Phe Lys Ser He Arg Asp Tyr Glu Ala Lys Ser 
865 870 875 880 

Arg Pro Thr Ser Val Val He Glu Ser Pro Lys Arg Leu Ser Leu Leu 
885 890 895 

Ser Ser Val Ser Pro Asp Val Ser Ala Val Ser Ser Ala Leu Glu Asn 
900 905 910 

Leu Asp Leu Phe Lys Asn Phe Asn Ser Ser Ser Asp Asp He Ala Glu 
915 920 925 
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Phe Thr Val Gin Val Val Leu Lys Cys Ala Ser Leu Glu Lys lie Phe 
930 935 940 

Asp He Leu Val Leu Thr Ser Arg Val Phe Ser Asn Leu Val Thr Thr 
945 950 955 960 

Thr Asp Leu Val Ser Tyr Phe Asn Ser Glu Lys Ala Arg Arg Glu Lys 
965 970 975 

Ser Gly Ala Gin His Asn Gly Gin His Ser He Gly Leu Leu Asp Phe 
980 985 990 

Ala Leu He Ser Leu lie Met Asp Asn Glu Leu Phe Ala Glu Thr Phe 
995 1000 1005 

Phe Asn Asn Tyr Lys Ser Phe Thr Thr Thr Leu Cys Val Leu Glu Asn 
1010 1015 1020 

Leu Ala Lys Arg Phe He Gly Ala Lys Ser Ser Ala He Ser He Ser 
025 1030 1035 1040 

Leu He Asn Lys Leu Arg Asn Ser Glu Ser Ser Arg Gin lie Pro Pro 
1045 1050 1055 

Ser Thr Thr Ser Asn Gin Phe Ser Ala Ser Gly lie Phe Lys Pro Ser 
1060 1065 1070 

Tyr Asp Glu Leu Lys Phe Pro Val Trp Asp Leu Lys Val Thr Ser Val 
1075 1080 1085 

Glu Gly Cys Pro Leu Asp Tyr Leu Ala Lys lie Gin He Gly Val Leu 
1090 1095 1100 
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Glu Ser Leu Tyr His Leu He Arg Glu His Tyr Ala Asp Phe Thr Asp 
105 1110 1115 1120 

Asp Leu Ala Asn Asn Lys Thr Phe Leu Asp He Leu Lys He lie Asn 
1125 1130 1135 

Gin Glu Val Tyr Asp Glu Trp Asp Lys Arg Leu Asp Asp Leu Arg Asn 
1140 1145 1150 

Asn Asn Asn Ser Ser Gin Lys Arg Lys Asn Ser Cys Asp Asp Asn Ser 
1155 1160 1165 

Ser Ala Lys He Thr Phe His Val Asn Asp Ala Arg Pro Glu Asn Ser 
1170 1175 1180 

Asn Glu Asn Lys Arg Gly Ala Ala TTir Asn Leu Gly Asp Ser Ser Leu 
185 1190 1195 1200 

Ala Ala Leu Glu Lys Leu Gin Cys Thr Leu Gin Asp Leu Tyr Val Lys 
1205 1210 1215 

lie Lys Ser Ser Tyr Gin Arg Gin Leu Tyr Arg Pro Leu Gly Val Thr 
1220 1225 1230 

Arg Asn Cys Arg Lys Val His Asp Met Leu Cys Gin Phe Gin Pro Gin 
1235 1240 1245 

Thr Ser Met Ser Ala Leu lie Met Asn Gly Ser Ser Asp Thr Leu Asp 
1250 1255 1260 

Lys Met Val Thr Glu Phe Gin Ala Leu Lys His Thr Asp Tyr Asp Asp 
265 1270 1275 1280 
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lie ile Asn Tip lie Tyr Lys Leu Asp His Phe lie Thr Ser Lys Leu 
1285 1290 1295 

Lys Leu Val Ser Asn Gin Asp Trp Ile Gin Val Ser Gin Ile Leu Glu 
1300 1305 1310 

Ser Leu Ser Asn Asp Ser Leu Val Ala Leu Phe Asn Tyr Pro Leu His 
1315 1320 1325 

Ala Glu Ser Asn Asn Val lie Ala Ser Gly Ser Ser Gin Leu Asp Asp 
1330 1335 1340 

Leu Gin ile Leu Asp lie Phe Thr Trp Leu Ser Thr Leu Glu Ser Gly 
345 1350 1355 1360 

Ser Ala His Ile lie Asp Lys Phe Pro Ala Ser Val Gin Leu ile Vai 
1365 1370 1375 

Arg Leu His Leu Ser Leu Thr Lys Phe Phe Thr Val His lie Ala His 
1380 1385 1390 

Leu His Ser Thr Tyr Glu Ala Arg Vai Asn Thr Cys Ser Leu Ile Leu 
1395 1400 1405 

Glu ile Leu Asn Phe Val His Val Lys Asn Ala Asn Val Asn Leu Phe 
1410 1415 1420 

His Ser Asp Asp Ala Gly Glu Gly Ser Met Ala Thr ile Ser Pro His 
425 1430 1435 1440 

Val Pro Ser Phe ile Glu Thr Ala lie Glu Asn Ala lie lie Ser Pro 
1445 1450 1455 
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Glu Ser Arg Phe Phe Glu Val Ser Trp Lys Gin Ala Tyr Lys Thr lie 
1460 1465 1470 

Ser Glu Lys Asp Glu Lys Leu Thr Phe He Gly Ser Val Leu Thr Gly 
1475 1480 1485 

Leu Asp Lys Ser Thr Ala His Phe Leu Asp Ala Asp Asn Arg Gin Pro 
1490 1495 1500 

Val Arg Pro Lys Asn Phe Ser Pro Cys Pro Gly Trp Phe lie Ser Arg 
505 1510 1515 1520 

Leu Leu Glu lie Thr Gly Leu Val Pro Asn Met Ser He Glu Asn Ser 
1525 1530 1535 

Lys Met He Asn Phe Asp Lys Arg Arg Phe lie Asn Asn He Val He 
1540 1545 1550 

Asn Tyr Gin Asp Leu He Pro Asn Thr Glu Gin Leu Pro Ser His Asp 
1555 1560 1565 

Asp Glu Lys Ser Ala His Gin Phe Gly Ser He Leu Phe His Tyr Gly 
1570 1575 1580 

Thr Glu Ser Ser He Lys Ala Phe Arg Lys Ala Ser Lys Glu Ala Ala 
585 1590 1595 1600 

Ser Asn Glu Ala Arg Lys Leu Lys Phe Gin Ala Met Gly Leu Phe Asn 
1605 1610 1615 

Asp He Leu Val Thr Glu Val Tyr Lys Val Gin Arg Asp Gin Lys Lys 
1620 1625 1630 
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Gin Glu Gin Leu Thr Val Gin Glu His Giu Ala Lys Arg Ser Val Leu 
1635 1640 1645 

lie Gin i-lis Pro Asn Lys Val Ser Val Ser Ser Ala Ser Ser Ser Val 
1650 1655 1660 

Ser Gly Ser Ser Ser Gly Ser Thr Ala Arg Thr Ser Asn Pro Ala His 
665 1670 1675 1680 

Ala Ala Tyr Ala Leu Asn Met Ala Gly Ser Leu Ser lie Ser Ala Ala 
1685 1690 1695 

Ai^ His Gly Arg Ser Ser Val Ser Ser Arg Ser Ser Val lie Ser Asn 
1700 1705 1710 

Thr Ala Thr Ala Thr Ser Pro Ala Ser Gly Ala Ser Pro Asn Gin TTir 
1715 1720 1725 

Ser Thr Ser His His Gly Gly Met Gly Lys Lys He Gly Gly Phe Leu 
1730 1735 1740 

Arg Arg Pro Phe Ser lie Ser Gly Phe Thr Ser Ser Ser Ser Gin Tyr 
745 1750 1755 1760 

Thr Thr Thr Ser Val Val Leu Ser Gly Val Gin Ala Asn Gly Ser He 
1765 1770 1775 

Ser Pro Tyr Glu Leu Pro Glu Leu Thr Ser Glu lie Gin Asp Thr Lys 
1780 1785 1790 

He Val Thr Val He Lys Thr Phe Glu He Lys Ser Cys lie Gin lie 
1795 1800 1805 
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Asn Asn Tyr Arg Gin Asp Pro Asp Met Met His Cys Phe Lys lie Val 
1810 1815 1820 

Met Glu Asp Gly Thr Gin His Thr Leu Gin Cys Met Asp Asp Ala Asp 
825 1830 1835 1840 

Met His Glu Trp Met Lys Ala He Thr Leu Ser Lys Aig Tyr Ser Phe 
1845 1850 1855 

His Ser Lys Arg Phe Lys Gly Lys Thr Ser Asn Lys He Phe Gly Val 
1860 1865 1870 

Pro Val Glu Asp Val Cys Glu Arg Glu Gly Ala Leu lie Pro Asn lie 
1875 1880 1885 

He Val Lys Leu Leu Asp Glu He Glu Leu Arg Gly Leu Asp Glu Val 
1890 1895 1900 

Gly Leu Tyr Arg Val Pro Gly Ser Val Gly Ser He Asn Ala Leu Lys 
905 1910 1915 1920 

Asn Ala Phe Asp Asp Glu Gly Ala Val His Asn Thr Phe Thr Leu Glu 
1925 1930 1935 

Asp Asp Arg Trp Phe Glu lie Asn Thr He Ala Gly Cys Phe Lys Leu 
1940 1945 1950 

Tyr Leu Arg Glu Leu Pro Glu Ser Leu Phe Thr Asn Glu Lys Val Asp 
1955 1960 1965 

Glu Phe Val Asn He Met Thr Ala Tyr Lys Asn His Glu Val Asp Leu 
1970 1975 1980 
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Ser Gin Phe Gin Asn Gly lie Lys Thr Leu Leu Ser Thr Leu Pro Val 
985 1990 1995 2000 

Phe Asn Tyr His lie Leu Lys Arg Leu Phe Leu His Leu Asn Arg Val 
2005 2010 2015 

His Gin His Val Glu Asn Asn Arg Met Asp Ala Ser Asn Leu Ala lie 
2020 2025 2030 

Val Phe Ser Met Ser Phe lie Asn Gin Asp Asp Leu Ala Ser Thr Met 
2035 2040 2045 

Gly Pro Thr Leu Giy Leu Leu Gin Met Leu Leu Gin His Leu lie Arg 
2050 2055 2060 

Asn Pro Glu His Tyr Phe Thr 
085 2070 



<210>7 

<211>3042 

<212> DNA 

<213> Ashbya gossypli 

<220> 

<221>CDS 

<222>(1)..(3042) 

<400>7 

atg gga gac ggg tea gac gca gaa cgc age ggg ggg aca ago teg tec 48 
Met Gly Asp Giy Ser Asp Ala Glu Arg Ser Giy Gly Thr Ser Ser Ser 
15 10 15 
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tea gca ttg gaa ctt ctt gcg cag tat gag cag cac att atg gag egg 96 
Ser Ala Leu Glu Leu Leu Ala Gin Tyr Glu Gin His lie Met Glu Arg 
20 25 30 

ggg agg acg ttg gag gcg att gaa ggg cac ggc ggg gag egg ctg ggg 144 
Gly Arg Thr Leu Glu Ala lie Glu Gly His Gly Gly Glu Arg Leu Gly 
35 40 45 

cca acg tac gag gag ctt gtg gag gag aac gtg cag etc egg egg gag 1 92 
Pro Thr Tyr Glu Glu Leu Val Glu Glu Asn Val Gin Leu Arg Arg Glu 
50 55 60 

ctg cag ggg eag egg gag gaa ata gaa cac etc cgc aaa acg att tet 240 
Leu Gin Gly Gin Arg Glu Glu He Glu His Leu Arg Lys Thr He Ser 
65 70 75 80 

ctg ctt gcg teg ggg egg age ggc gcg acg gtg gtc gag cag cag gtg 288 
Leu Leu Ala Ser Gly Arg Ser Gly Ala Thr Val Val Glu Gin Gin Val 
85 90 95 

cgt ect gag ect teg ecg tec gta ega gag ctg gcg ctg ccg ccg egg 336 
Arg Pro Glu Pro Ser Pro Ser Val Arg Glu Leu Ala Leu Pro Pro Arg 
100 105 110 

tec gcg gac egg cga aag aac acc aaa aac ctg agt etc gee ccg gtg 384 
Ser Ala Asp Arg Arg Lys Asn Thr Lys Asn Leu Ser Leu Ala Pro Val 
115 120 125 

ggc cac gag gtg ccg tog acc gac egg ctg cgt gtc teg ccg cag gag 432 
Gly His Glu Val Pro Ser Thr Asp Arg Leu Arg Val Ser Pro Gin Glu 
130 135 140 
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gcc acg age ggg gca cag cag gtg ccc ttg eta acc tct teg aag tec 480 
Ala Thr Ser Gly Ala Gin Gin Val Pro Leu Leu Thr Ser Ser Lys Ser 
145 150 155 160 

gcc gag att ctg gtg teg aaa tct ccg gat gaa gac egc eac ttg atg 528 
Ala Glu lie Leu Val Ser Lys Ser Pro Asp Glu Asp Arg His Leu Met 
165 170 175 

teg cet agg aag aca att tea egg tec agt teg tea tat teg aat acg 576 
Ser Pro Arg Lys Thr He Ser Arg Ser Ser Ser Ser Tyr Ser Asn Thr 
180 185 190 

eta ggc age ect gca act tec gtt ctg tat aag aac tct egg ata tea 624 
Leu Gly Ser Pro Ala Thr Ser Val Leu Tyr Lys Asn Ser Arg lie Ser 
195 200 205 

att act tct ccg tgc aag tct aac tct acg age aaa get gcg tct gtg 672 
lie Thr Ser Pro Cys Lys Ser Asn Ser Thr Ser Lys Ala Ala Ser Val 
210 215 220 

ttg agt eta cea gaa aat aac acg tec acc gag aat gcg ccg cat tea 720 
Leu Ser Leu Pro Glu Asn Asn Thr Ser Thr Glu Asn Ala Pro His Ser 
225 230 235 240 

cea eac aga ata gac aac gaa ttg gac ttg etc acc gtg gag ect caa 768 
Pro His Arg He Asp Asn Glu Leu Asp Leu Leu Thr Val Glu Pro Gin 
245 250 255 

gat gga age agg tac gat aca gag aga gca ggt ggt ccg ggg cea ttg 81 6 
Asp Gly Ser Arg Tyr Asp Thr Glu Arg Ala Gly Gly Pro Gly Pro Leu 
260 265 270 

teg ect gag age ata gtg tac agt gat teg gac ttg caa gag eat caa 864 
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Ser Pro Glu Ser He Val Tyr Ser Asp Ser Asp Leu Gin Glu His Gin 
275 280 285 

cct tct gat ctg tea tct acc act agg acg gat tta ggc aaa ttc aga 912 
Pro Ser Asp Leu Ser Ser Thr Thr Arg Thr Asp Leu Gly Lys Phe Arg 
290 295 300 

gat atg gtg gat act acc ttc aat gca gaa gac aac cct acg ggt tea 960 
Asp Met Val Asp Thr Thr Phe Asn Ala Glu Asp Asn Pro Thr Gly Ser 
305 310 315 320 

cga gac aag gag act gga acg gaa atg gag ate get acg eta caa aat 1008 
Arg Asp Lys Glu Thr Gly Thr Glu Met Glu lie Ala Thr Leu Gin Asn 
325 330 335 

acg ccc age aga caa cat gaa teg agt ttg gta aca agt eca caa get 1 056 
Thr Pro Ser Arg Gin His Glu Ser Ser Leu Val Thr Ser Pro Gin Ala 
340 345 350 

tct agg tea teg att aca acg cca gtc gtg gat cct act aat acg age 11 04 
Ser Arg Ser Ser lie Thr Thr Pro Val Val Asp Pro Thr Asn Thr Ser 
355 360 365 

gaa cct tct teg ctt tea gca geg aag ttt gga agt atg tct acc get 11 52 
Glu Pro Ser Ser Leu Ser Ala Ala Lys Phe Gly Ser Met Ser Thr Ala 
370 375 380 

aca tec teg aac aaa agg tec aag ggc atg ggc act cct tec gtg gaa 1 200 
Thr Ser Ser Asn Lys Arg Ser Lys Gly Met Gly Thr Pro Ser Val Glu 
385 390 395 400 

cat tea gca aag tea tac teg cag cat tct ggt age ccc cac tct aac 1 248 
His Ser Ala Lys Ser Tyr Ser Gin His Ser Gly Ser Pro His Ser Asn 
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405 410 415 

tct cac cag tec aag aaa gca gat att ccc tta ttt gta cag cca gag 1 296 
Ser His Gin Ser Lys Lys Ala Asp lie Pro Leu Phe Val Gin Pro Glu 
420 425 430 

gag tta ggt acg ate agg att gag gtc att agt aca ttg tat cat gag 1 344 
Glu Leu Gly Thr lie Arg lie Giu Val lie Ser Thr Leu Tyr His Glu 
435 440 445 

cct gga aac gca gcc age att etc ttt agt gtt gtt gat aag aag tct 1 392 
Pro Gly Asn Ala Ala Ser lie Leu Phe Ser Vai Val Asp Lys Lys Ser 
450 455 460 

tee aag gag atg ttc aaa ttt get aaa act ttt ace egc att gca gag 1 440 
Ser Lys Glu Met Phe Lys Phe Ala Lys Thr Phe Thr Arg He Ala Glu 
465 470 475 480 

ttc gat ace ttt ate aga aac aat atg gaa tct tta gee gtc ccc ccc 1 488 
Phe Asp Thr Phe lie Arg Asn Asn Met Glu Ser Leu Ala Val Pro Pro 
485 490 495 

ctt ccc gac aag cac atg ttt get teg aac gtg cea gta aag gta gac 1 536 
Leu Pro Asp Lys His Met Phe Ala Ser Asn Val Pro Val Lys Vai Asp 
500 505 510 

agt agg aga gaa aag ctt aat gac tac ttt get agt ttg ttg tat eta 1 584 
Ser Arg Arg Glu Lys Leu Asn Asp Tyr Phe Ala Ser Leu Leu Tyr Leu 
515 520 525 

tec cca tta ccc ttt aat cca gca ttg aag tta gcg caa ttc att age 1 632 
Ser Pro Leu Pro Phe Asn Pro Ala Leu Lys Leu Ala Gin Phe He Ser 
530 535 540 
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aca gac cct gtt atg aac cct ata act ggc gaa ttt get aaa gag ggc 1 680 
Thr Asp Pro Val Met Asn Pro lie Thr Gly Glu Phe Ala Lys Glu Gly 
545 550 555 560 

atg eta eta gte cgt aaa tct aaa acc ttg ggt agt act act acg tgg 1728 
Met Leu Leu Val Arg Lys Ser Lys Thr Leu Gly Ser Thr Thr Thr Trp 
565 570 575 

cgt att agg tac tgc aca gtt gag ggc tct ata atg eat etc cat gac 1776 
Arg lie Arg Tyr Cys Thr Val Glu Gly Ser lie Met His Leu His Asp 
580 585 590 

cat atg att gat act gat acg ate aaa ttg acg cat tct acg att gaa 1 824 
His Met lie Asp Thr Asp Thr lie Lys Leu Thr His Ser Thr lie Glu 
595 600 605 

ctt cag gca aac etc ccg gat gat aag tat ggg acc aag aat gga ttc 1872 
Leu Gin Ala Asn Leu Pro Asp Asp Lys Tyr Gly Thr Lys Asn Gly Phe 
610 615 620 

ata ctt aat gaa cac aaa aag agt ggt ctt tea age tct aca aag tac 1 920 
lie Leu Asn Glu His Lys Lys Ser Gly Leu Ser Ser Ser Thr Lys Tyr 
625 630 635 640 

tat ttt tgc gc^ gaa acg cca aaa gaa cgt gaa caa tgg ata age gta 1 968 
Tyr Phe Cys Ala Glu Thr Pro Lys Glu Arg Glu Gin Trp lie Ser Val 
645 650 655 

ttg acc act etc tgc gat ggc cca ggt ggt aca gca gee att cca tec 201 6 
Leu Thr Thr Leu Cys Asp Gly Pro Gly Gly Thr Ala Ala lie Pro Ser 
660 665 670 
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att aat age aag tct gaa gcg tct agt tta ttc gag caa aca age att 2064 
ile Asn Ser Lys Ser Glu Ala Ser Ser Leu Phe Giu Gin Thr Ser lie 
675 680 685 

age gac tct agt tat ett gga eca att get aat cte gag gca atg gat 2112 
Ser Asp Ser Ser Tyr Leu Giy Pro Ile Ala Asn Leu Glu Ala Met Asp 
690 695 700 

gea aet tet ccg aea aga eca aat gat eca aac ccg gtc tec tta aca 21 60 
Ala Thr Ser Pro Thr Arg Pro Asn Asp Pro Asn Pro Val Ser Leu Thr 
705 710 715 720 

tct gaa gaa gag aaa gag gtc aag aga cga cgt atg aag tea ttc ttc 2208 
Ser Glu Glu Glu Lys Glu Val Lys Arg Arg Arg Met Lys Ser Phe Phe 
725 730 735 

cct ttc aag aag tta get act aca ect ace ecc tac get get gga aac 2256 
Pro Phe Lys Lys Leu Ala Thr Thr Pro Hir Pro Tyr Ala Ala Gly Asn 
740 745 750 

gac aat get tct ata ttt teg caa gat gat gat age cct gtg aat get 2304 
Asp Asn Ala Ser Ile Phe Ser Gin Asp Asp Asp Ser Pro Val Asn Ala 
755 760 765 

aca aat gaa agt ggt att tea aga tea etc cag tec atg aat tta caa 2352 
Thr Asn Glu Ser Gly ile Ser Arg Ser Leu Gin Ser Met Asn Leu Gin 
770 775 780 

gea cag tat aac gcg gta ttt gga gcg gac ttg aga tee tgt tta caa 2400 
Ala Gin Tyr Asn Ala Val Phe Gly Ala Asp Leu Arg Ser Cys Leu Gin 
785 790 795 800 

eta agt teg cat cec tac cag gga aaa tat gaa ata oca agt gtt gta 2448 
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Leu Ser Ser His Pro Tyr Gin Gly Lys Tyr Glu lie Pro Ser Val Val 
805 810 815 

ttc cga acg eta gaa ttc ttg tac aaa aac cgc ggc att cag gaa gaa 2496 
Phe Arg Thr Leu Glu Phe Leu Tyr Lys Asn Arg Gly lie Gin Glu Glu 
820 825 830 

ggt ata ttt agg tta age gga tec agt tet etc ata aaa tct ttg cag 2544 
Gly lie Phe Arg Leu Ser Gly Ser Ser Ser Leu lie Lys Ser Leu Gin 
835 840 845 

gag caa ttt gae aaa gaa tat gac gtg gat ttg tgc aat tae aac gat 2592 
Glu Gin Phe Asp Lys Glu Tyr Asp Val Asp Leu Cys Asn Tyr Asn Asp 
850 855 860 

aaa gtt tct gtc aca cca gga aac gaa aat cag ggc ggt etc tac gtc 2640 
Lys Val Ser Val Thr Pro Gly Asn Glu Asn Gin Gly Gly Leu Tyr Val 
865 870 875 880 

gat gtg aat ace gtt tea ggt tta tta aaa eta tac eta aga aag ett 2688 
Asp Val Asn Thr Val Ser Gly Leu Leu Lys Leu Tyr Leu Arg Lys Leu 
885 890 895 

cet eat atg ate ttt ggg gat get gea tat atg gat ttt aag aga ate 2736 
Pro His Met lie Phe Gly Asp Ala Ala Tyr Met Asp Phe Lys Arg lie 
900 905 910 

gtg gaa aga aac gga gat gat age aaa eta ata gea etc gag ttc agg 2784 
Val Glu Arg Asn Gly Asp Asp Ser Lys Leu lie Ala Leu Glu Phe Arg 
915 920 925 

gea ttg gtt aat tec gga cga att gcc aaa gaa tat gtc gee tta atg 2832 
Ala Leu Val Asn Ser Gly Arg He Ala Lys Glu Tyr Val Ala Leu Met 
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930 935 940 

tat gca ttg ttc gag tta ttg gtg aag ate acc gag aac age aaa tat 2880 
Tyr Ala Leu Phe Glu Leu Leu Val Lys lie Thr Giu Asn Ser Lys Tyr 
945 950 955 960 

aac aag atg aat ctg egg aat ttg tgt ate gta ttt teg cea acg ttg 2928 
Asn Lys Met Asn Leu Arg Asn Leu Cys lie Val Phe Ser Pro Thr Leu 
965 970 975 

aac ata ccc gtg aat ata eta eat ccg ttt ate act gae ttt gge tgt 2976 
Asn lie Pro Val Asn lie Leu His Pro Phe He Thr Asp Phe Gly Cys 
980 985 990 

ata ttc caa gat aag gcg ccg atg gag aac gga cea ccg gtc aac ata 3024 
He Phe Gin Asp Lys Ala Pro Met Glu Asn Gly Pro Pro Val Asn He 
995 1000 1005 

cac ate ccg caa att tag 3042 
His He Pro Gin He 
1010 



<210>8 

<211>1013 

<212>PRT 

<213> Ashbya gossypH 
<400> 8 

Met Gly Asp Gly Ser Asp Ala Glu Arg Ser Gly Gly Thr Ser Ser Ser 
15 10 15 

Ser Ala Leu Glu Leu Leu Ala Gin Tyr Glu Gin His He Met Glu Arg 
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20 25 30 

Gly Arg Thr Leu Glu Ala lie Glu Gly His Gly Gly Glu Arg Leu Gly 
35 40 45 

Pro Thr Tyr Glu Glu Leu Val Glu Glu Asn Val Gin Leu Arg Arg Glu 
50 55 60 

Leu Gin Gly Gin Arg Glu Glu lie Glu His Leu Arg Lys Thr lie Ser 
65 70 75 80 

Leu Leu Ala Ser Gly Arg Ser Gly Ala Thr Val Val Glu Gin Gin Val 
85 90 95 

Arg Pro Glu Pro Ser Pro Ser Val Arg Glu Leu Ala Leu Pro Pro Arg 
100 105 110 

Ser Ala Asp Arg Arg Lys Asn Thr Lys Asn Leu Ser Leu Ala Pro Val 
115 120 125 

Gly His Glu Val Pro Ser Thr Asp Arg Leu Arg Val Ser Pro Gin Glu 
130 135 140 

Ala Thr Ser Gly Ala Gin Gin Val Pro Leu Leu Thr Ser Ser Lys Ser 
145 150 155 160 

Ala Glu lie Leu Val Ser Lys Ser Pro Asp Glu Asp Arg His Leu Met 
165 170 175 

Ser Pro Arg Lys Thr lie Ser Arig Ser Ser Ser Ser Tyr Ser Asn Thr 
180 185 190 

Leu Gly Ser Pro Ala Thr Ser Val Leu Tyr Lys Asn Ser Arg He Ser 
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195 200 205 

ile Thr Ser Pro Cys Lys Ser Asn Ser Thr Ser Lys Ala Ala Ser Val 
210 215 220 

Leu Ser Leu Pro Glu Asn Asn Thr Ser Thr Glu Asn Ala Pro His Ser 
225 230 235 240 

Pro His Arg lie Asp Asn Glu Leu Asp Leu Leu Thr Val Glu Pro Gin 
245 250 255 

Asp Gly Ser Arg Tyr Asp Thr Glu Arg Ala Gly Gly Pro Gly Pro Leu 
260 265 270 

Ser Pro Glu Ser lie Val Tyr Ser Asp Ser Asp Leu Gin Glu His Gin 
275 280 285 

Pro Ser Asp Leu Ser Ser Thr Thr Arg Thr Asp Leu Gly Lys Phe Arg 
290 295 300 

Asp Met Val Asp Thr Thr Phe Asn Ala Glu Asp Asn Pro Thr Gly Ser 
305 310 315 320 

Arg Asp Lys Glu Thr Gly Thr Glu Met Glu lie Ala Thr Leu Gin Asn 
325 330 335 

Thr Pro Ser Arg Gin His Glu Ser Ser Leu Val Thr Ser Pro Gin Ala 
340 345 350 

Ser Arg Ser Ser lie Thr Thr Pro Val Val Asp Pro Thr Asn Thr Ser 
355 360 365 

Glu Pro Ser Ser Leu Ser Ala Ala Lys Phe Gly Ser Met Ser Thr Ala 
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370 375 380 

Thr Ser Ser Asn Lys Arg Ser Lys Gly Met Gly Thr Pro Ser Val Glu 
385 390 395 400 

His Ser Ala Lys Ser Tyr Ser Gin His Ser Gly Ser Pro His Ser Asn 
405 410 415 

Ser His Gin Ser Lys Lys Ala Asp He Pro Leu Phe Val Gin Pro Glu 
420 425 430 

Glu Leu Gly Thr lie Arg lie Glu Val tie Ser Thr Leu Tyr His Glu 
435 440 445 

Pro Gly Asn Ala Ala Ser He Leu Phe Ser Val Val Asp Lys Lys Ser 
450 455 460 

Ser Lys Glu Met Phe Lys Phe Ala Lys Thr Phe Thr Arg He Ala Glu 
465 470 475 480 

Phe Asp Thr Phe lie Arg Asn Asn Met Glu Ser Leu Ala Val Pro Pro 
485 490 495 

Leu Pro Asp Lys His Met Phe Ala Ser Asn Val Pro Val Lys Val Asp 
500 505 510 

Ser Arg Arg Glu Lys Leu Asn Asp Tyr Phe Ala Ser Leu Leu Tyr Leu 
515 520 525 

Ser Pro Leu Pro Phe Asn Pro Ala Leu Lys Leu Ala Gin Phe He Ser 
530 535 540 

Thr Asp Pro Val Met Asn Pro He Thr Gly Glu Phe Ala Lys Glu Gly 
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545 550 555 560 

Met Leu Leu Val Arg Lys Ser Lys Thr Leu Gly Ser Thr Thr Thr Trp 
565 570 575 

Arg lie Arg Tyr Cys Thr Val Glu Gly Ser lie Met His Leu His Asp 
580 585 590 

His Met lie Asp Thr Asp Thr lie Lys Leu Thr His Ser Thr lie Glu 
595 600 605 

Leu G\n Ala Asn Leu Pro Asp Asp Lys Tyr Gly Thr Lys Asn Gly Phe 
610 615 620 

He Leu Asn Glu His Lys Lys Ser Gly Leu Ser Ser Ser Thr Lys Tyr 
625 630 635 640 

Tyr Phe Cys Ala Glu Thr Pro Lys Glu Arg Glu Gin Trp lie Ser Val 
645 650 655 

Leu Thr Thr Leu Cys Asp Gly Pro Gly Gly Thr Ala Ala lie Pro Ser 
660 665 670 

lie Asn Ser Lys Ser Glu Ala Ser Ser Leu Phe Glu Gin Thr Ser lie 
675 680 685 

Ser Asp Ser Ser Tyr Leu Gly Pro He Ala Asn Leu Glu Ala Met Asp 
690 695 700 

Ala Thr Ser Pro Thr Arg Pro Asn Asp Pro Asn Pro Val Ser Leu Thr 
705 710 715 720 

Ser Glu Glu Glu Lys Glu Val Lys Arg Arg Arg Met Lys Ser Phe Phe 
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725 730 735 

Pro Phe Lys Lys Leu Ala Thr Thr Pro Thr Pro Tyr Ala Ala Gly Asn 
740 745 750 

Asp Asn Ala Ser lie Phe Ser Gin Asp Asp Asp Ser Pro Val Asn Ala 
755 760 765 

Thr Asn Glu Ser Gly He Ser Arg Ser Leu Gin Ser Met Asn Leu Gin 
770 775 780 

Ala Gin Tyr Asn Ala Val Phe Gly Ala Asp Leu Arg Ser Cys Leu Gin 
785 790 795 800 

Leu Ser Ser His Pro Tyr Gin Gly Lys Tyr Glu lie Pro Ser Val Val 
805 810 815 

Phe Arg Thr Leu Glu Phe Leu Tyr Lys Asn Arg Gly lie Gin Glu Glu 
820 825 830 

Gly lie Phe Arg Leu Ser Gly Ser Ser Ser Leu lie Lys Ser Leu Gin 
835 840 845 

Glu Gin Phe Asp Lys Glu Tyr Asp Val Asp Leu Cys Asn Tyr Asn Asp 
850 855 860 

Lys Val Ser Val Thr Pro Gly Asn Glu Asn Gin Gly Gly Leu Tyr Val 
865 870 875 880 

Asp Val Asn Thr Val Ser Gly Leu Leu Lys Leu Tyr Leu Arg Lys Leu 
885 890 895 

Pro His Met He Phe Gly Asp Ala Ala Tyr Met Asp Phe Lys Arg lie 
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900 905 910 

Val Glu Arg Asn Qly Asp Asp Ser Lys Leu lie Ala Leu Glu Phe Arg 
915 920 925 

Ala Leu Val Asn Ser Gly Arg lie Ala Lys Glu Tyr Val Ala Leu Met 
930 935 940 

Tyr Ala Leu Phe Glu Leu Leu Val Lys lie Thr Glu Asn Ser Lys Tyr 
945 950 955 960 

Asn Lys Met Asn Leu Arg Asn Leu Cys He Val Phe Ser Pro Thr Leu 
965 970 975 

Asn He Pro Val Asn He Leu His Pro Phe He Thr Asp Phe Gly Cys 
980 985 990 

lie Phe Gin Asp Lys Ala Pro Met Glu Asn Gly Pro Pro Val Asn He 
995 1000 1005 

His He Pro Gin He 
1010 



<210>9 

<211>530 

<212> DMA 

<213> Ashbya gossypil 

<220> 
<221> CDS 
<222> (1)..(528) 
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<400> 9 

cag gcc atg cat gaa ggg tta aat ata ata aaa att gac aac tgg eta 48 
Gin Ala Met His Glu Gly Leu Asn He lie Lys lie Asp Asn Trp Leu 
15 10 15 

gaa gtg ata ccg cag ttg ata tec cga att cac cag cct aac caa acc 96 
Glu Val lie Pro Gin Leu He Ser Arg He His Gin Pro Asn Gin Thr 
20 25 30 

gtg agt aga aca tta tta tct etc tta tet gac etc ggc aag get cat 1 44 
Val Ser Arg Thr Leu Leu Ser Leu Leu Ser Asp Leu Gly Lys Ala His 
35 40 45 

cct cag get etc gtc ttc cct eta aca gtt get ata aaa tet gaa tct 1 92 
Pro Gin Ala Leu Val Phe Pro Leu Thr Val Ala lie Lys Ser Glu Ser 
50 55 60 

gta tct agg cag aga get get ttg tet att atg gag aag atg cgt atg 240 
Val Ser Arg Gin Arg Ala Ala Leu Ser He Met Glu Lys Met Arg Met 
65 70 75 80 

cat agt tct aat etg gtt gaa cag gea gaa ctg gtt age aat gag etc 288 
His Ser Ser Asn Leu Val Glu Gin Ala Glu Leu Val Ser Asn Glu Leu 
85 90 95 

att cgt att get gtg ctg tgg eat gag eta tgg tat gaa ggt ctg gag 336 
He Arg lie Ala Val Leu Trp His Glu Leu Trp Tyr Glu Gly Leu Glu 
100 105 110 

gac gcg agt aga cag ttt etc gga gag cat aat acg gaa aag atg ttc 384 
Asp Ala Ser Arg Gin Phe Leu Gly Glu His Asn Thr Glu Lys Met Phe 
115 120 125 
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get act ttg gaa cca ctg cat gaa atg ttg aag agg gga cct gag act 432 
Ala Thr Leu Glu Pro Leu His Glu Met Leu Lys Arg Gly Pro Glu Thr 
130 135 140 

eta egg gag ata tea ttc cag aat tea ttt ggt aga gac ctg aat gac 480 
Leu Arg Glu lie Ser Phe Gin Asn Ser Phe Gly Arg Asp Leu Asn Asp 
145 150 155 160 

gca tat gaa tgg gtc atg aae tat aag agg aca cag gat ate agt aat 528 
Ala Tyr Glu Trp Val Met Asn Tyr Lys Arg Thr Gin Asp lie Ser Asn 
165 170 175 

tt 530 



<210> 10 
<211>176 
<212> PRT 

<213> Ashbya gossypH 
<400> 10 

Gin Ala Met His Glu Gly Leu Asn He He Lys He Asp Asn Trp Leu 
1 5 10 15 

Glu Val He Pro Gin Leu He Ser Arg He His Gin Pro Asn Gin Thr 
20 25 30 

Val Ser Arg Thr Leu Leu Ser Leu Leu Ser Asp Leu Gly Lys Ala His 
35 40 45 

Pro Gin Ala Leu Val Phe Pro Leu Thr Val Ala He Lys Ser Glu Ser 
50 55 60 
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Val Ser Arg Gin Arg Ala Ala Leu Ser lie Met Glu Lys Met Arg Met 
65 70 75 80 

His Ser Ser Asn Leu Val Glu Gin Ala Glu Leu Val Ser Asn Glu Leu 
85 90 95 

lie Arg lie Ala Val Leu Trp His Glu Leu Trp Tyr Glu Gly Leu Glu 
100 105 110 

Asp Ala Ser Arg Gin Phe Leu Gly Glu His Asn Thr Glu Lys Met Phe 
115 120 125 

Ala Thr Leu Glu Pro Leu His Glu Met Leu Lys Arg Gly Pro Glu Thr 
130 135 140 

Leu Arg Glu He Ser Phe Gin Asn Ser Phe Gly Arg Asp Leu Asn Asp 
145 150 155 160 

Ala Tyr Glu Trp Val Met Asn Tyr Lys Arg Thr Gin Asp He Ser Asn 
165 170 175 



<210> 11 

<211>402 

<212>DNA 

<213> Ashbya gossypii 

<220> 
<221>CDS 
<222> (1)..(402) 

<400>11 

gtg gac act tea ggc atg teg aga gag acg eta egg tae tac gaa ttt 48 
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Val Asp Thr Ser Gly Met Ser Arg Glu Thr Leu Arg Tyr Tyr Glu Phe 
1 5 10 15 

etc tgt aga gtt gga gag gca aaa cgt tgg att gag gat gtg ate ggc 96 
Leu Cys Arg Val Gly Glu Ala Lys Arg Trp lie Glu Asp Val lie Gly 
20 25 30 

gag aeg ata ect gga gaa etc gag ttg gca get ggt aat tea atg cge 144 
Glu Thr lie Pro Gly Glu Leu Glu Leu Ala Ala Gly Asn Ser Met Arg 
35 40 45 

gae ggc tat ttt ttg gcg aag gtc act caa aeg att aaa ect gat ett 1 92 
Asp Gly Tyr Phe Leu Ala Lys Val Thr Gin Thr lie Lys Pro Asp Leu 
50 55 60 

gca ect aca att gta ect ect ggt egg ttg cag ttc aag cat aca cag 240 
Ala Pro Thr lie Val Pro Pro Gly Arg Leu Gin Phe Lys His Thr Gin 
65 70 75 80 

aat att aat get ttt ttt teg ctg atg gat ttg gta ggc gta ccg gae 288 
Asn lie Asn Ala Phe Phe Ser Leu Met Asp Leu Val Gly Val Pro Asp 
85 90 95 

eta ttt ega ttt gaa ctg ace gae eta tac gag aag aaa gae gtt cea 336 
Leu Phe Arg Phe Glu Leu Thr Asp Leu Tyr Glu Lys Lys Asp Val Pro 
100 105 110 

aaa gtt ttt gag act tta cat gca gtc gcg aac att etc aat agt agg 384 
Lys Val Phe Glu Thr Leu His Ala Val Ala Asn lie Leu Asn Ser Arg 
115 120 125 



ttc cec ggc gag att ect 
Phe Pro Gly Glu lie Pro 



402 
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<210>12 

<211>134 

<212> PRT 

<213> Ashbya gossypii 

<400> 12 

Val Asp Thr Ser Gly Met Set Arg Giu Thr Leu Arg Tyr Tyr Glu Phe 
1 5 10 15 

Leu Cys Arg Val Gly Glu Ala Lys Arg Trp lie Glu Asp Val He Gly 
20 25 30 

Glu Thr lie Pro Gly Glu Leu Glu Leu Ala Ala Gly Asn Ser Met Arg 
35 40 45 

Asp Gly Tyr Phe Leu Ala Lys Val Thr Gin Thr lie Lys Pro Asp Leu 
50 55 60 

Ala Pro Thr lie Val Pro Pro Gly Arg Leu Gin Phe Lys His Thr Gin 
65 70 75 80 

Asn lie Asn Ala Phe Phe Ser Leu Met Asp Leu Val Gly Val Pro Asp 
85 90 95 

Leu Phe Arg Phe Glu Leu Thr Asp Leu Tyr Glu Lys Lys Asp Val Pro 
100 105 110 

Lys Val Phe Glu Thr Leu His Ala Val Ala Asn lie Leu Asn Ser Arg 
115 120 125 
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Phe Pro Gly GIu He Pro 
130 



<210>13 
<211>20 
<212>DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence:Primer 
<400> 13 

gctagggata acagggtaat 

<210> 14 
<211>20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence:Primer 
<400>14 

aggcatgcaa gcttagatct 

<210>15 
<211>23 
<212>DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence:Primer 



<400> 15 



gtttagtctg accatctcat ctg 



23 



<210> 16 
<211>21 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial SequencezPrimer 
<400>16 

tcgcagaccg ataccaggat c 21 

<210> 17 
<211>65 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence:Primer 
<400>17 

aggaccacta gctcgttgcg ctgcaatata ataataagaa cgagagctag ggataacagg 60 



gtaat 



65 



wo 00/2213, 



PCT/EP99/07501 
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<210>18 
<211>65 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence:Primer 
<400> 18 

aagtattcaa tcaactatgt gagtagtttc ttgtaggcag tctccaggca tgcaagctta 60 
gatct 65 

<210>19 
<211>65 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence:Primer 
<400> 19 

ctggcatcag aggaagctcc caccaccaag ctctacaaac acaaggctag ggataacagg 60 
gtaat 65 

<210>20 
<211>65 
<212>DNA 

<213> Artificial Sequence 



wo 00/22133 



PCT/EP99/07S01 



59 



<220> 

<223> Description of Artificial Sequence:Primer 



<400> 20 

attatattag tatagtctaa agttgcaggc agtgggtatt aaagtaggca tgcaagctta 60 



gatct 



65 



<210> 21 
<211>65 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence:Primer 
<400> 21 

acttgcgtac tctttcgcgt gctcgtcagc caccgaacaa cgcaggctag ggataacagg 60 
gtaat 65 



<210> 22 
<211>65 
<212>DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial SequenceiPrimer 



<400> 22 

ttaaagaatg ataaagaacc aaaaacacca cgagcttgca taacaaggca tgcaagctta 60 



wo 00/2213^ PCT/EP99/07501 



^F 



gatct 65 



<210>23 
<211>65 
<212> DNA 

<213> Artifictal Sequence 
<220> 

<223> Description of Artificial Sequence:Primer 
<400>23 

gtgcgtgtca gcgagcatct aatcaagctg caaggcgccg gaaatgctag ggataacagg 60 
gtaat 65 



<210>24 
<211>65 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence:Primer 
<400> 24 

ttatcacata tttctaagtt aatagatatt tttacttagt atgaaaggca tgcaagctta 60 
gatct 65 



<210>25 
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wo 00/22133 



PCT/EP99/07501 



61 



<211>65 
<:212>DNA 

<213> Aitificiai Sequence 
<220> 

<223> Description of Artificial Sequence:Primer 
<400> 25 

gagagagacg ctacggtact acgaatttct ctgtagagtt ggagagctag ggataacagg 60 
gtaat 65 



<210>26 
<211>65 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence:Primer 
<400> 26 

tactattgag aatgttcgcg actgcatgta aagtctcaaa aacttaggca tgcaagctta 60 
gatct 65 



<210>27 
<211>65 
<212>DNA 

<213> Artificial Sequence 



<220> 



00221 33A1_I_> 



wo 00/22133 



PCT/EP99/07501 
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<223> Description of Artificial SequencerPrimer 
<400> 27 

aaatataata aaaattgaca actggctaga agtgataccg cagttgctag ggataacagg 60 
gtaat 65 



<210>28 
<211>65 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial SequencePrimer 
<400> 28 

cctcttatag ttcatgaccc attcatatgc gtcattcagg tctctaggca tgcaagctta 60 



gatct 



65 
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I Claims Nos.: 

' ' because they relate to sutajeet matter not required to be searched by this Authority, namely: 



2. Pn Claims Nos.: 16,18-20 

because they relate to parts of the international Applbaiion that do not comply with the presorikied requirements to such 

an extent that no meaningful International Search can be carried out, specifically: 

A meaningful search of claims 16 and 18 was not possible due to a lack of 
characterization of the claimed fungicidal compounds and inhibitors. Claims 19 
and 20, relating directly and exclusively to these compounds and their use, 
could not be searched either. 

3. I Claims Nos.: 

t>eoause they are dependent claims and are not drafted in accordance with the second and third sentences of Rule 6.4(a). 



Box II Observations where unity of Invention Is lacking (Contlnuallon of Rem 2 of first sheet) 



This International Searching Authority found multiple inventions in this international application, as follows: 
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^— J searchatsle claims. 



2. I I As all searchable claims could be searched without effort justifying an additional fee, this Authority did not invite payment 
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□ As only some of the required additional search fees were timely paid by the applicant, this International Search Report 
covers only those claims lor which fees were paid, spedfieally claims Nos.: 



4. I y I No required additional search fees were timely paid by the applicant. Consequently, 
restricted to the invention first mentioned in the claims; it is covered by claims Nos.: 



, this International Search Report b 



Claims 1-20 (all partially) 
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FURTHER INFORMATION CONTINUED FROM PCT/ISA/ 210 



Continuation of Box 1.2 
Claims Nos.: 16» 18-20 



A meaningful search of claims 16 and 18 was not possible due to a lack of 
characterization of the claimed fungicidal compounds and inhibitors. 
Claims 19 and 20, relating directly and exclusively to these compounds 
and their use, could not be searched either. 

The applicant's attention is drawn to the fact that claims, or parts of 
claims, relating to inventions in respect of which no international 
search report has been established need not be the subject of an 
international preliminary examination (Rule 66.1(e) PCT) . The applicant 
is advised that the EPO policy when acting as an International 
Preliminary Examining Authority is normally not to carry out a 
preliminary examination on matter which has not been searched. This is 
the case irrespective of whether or not the claims are amended following 
receipt of the search report or during any Chapter II procedure. 
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FURTHER INFORMATION CON TINUED FROM PCT/ISA/ 210 

This International Searching Authority found multiple (groups of) 
inventions in this international application, as follows: 

1. Claims: 1-20. all partially 

The GTP-binding protein-like genes from Ashbya gossypii with 
nucleic acid sequences represented in seq^ID's 1 and 3, 
•encoding the respective protein seq.ID's 2 and 4, nucleic 
acids or proteins substatially similar to said nucleic acids 
and proteins, nucleic acids comprising at least 2G 
consequtive nucleotides from said sequences, chimeric genes 
and vectors comprising said nucleic acids, host- cell 
transformed with said vector, process for producing said 
protein using said host cell, process for identifying 
inhibitors of said protein's activity, inhibotirs identified 
by said process, and method of suppressing growth of a 
fungus by using said inhibitors. 

2. Claims: 1-20, all partially 

The GTPase activating protein-like genes from Ashbya 
gossypii with nucleic acid sequences represented in seq.ID's 
5 and 7, encoding the respective protein seq.ID's 6 and 8, 
nucleic acids or proteins substatially similar to said 
nucleic acids and proteins, nucleic acids comprising at 
least 20 consequtive nucleotides from said sequences, 
chimeric genes and vectors comprising said nucleic acids, 
host cell transformed with said vector, process for 
producing said protein using said host cell, process for 
identifying inhibitors of said protein's activity, 
inhibotirs identified by said process, and method of 
suppressing growth of a fungus by using said inhibitors. 

3. Claims: 1-20. all partially 

The phosphatidylinositol-4-kinase-like gene from Ashbya 
gossypii with nucleic acid sequences, represented in seq.lD 
9, encoding the respective protein seq.ID 10, nucleic acids 
or proteins substatially similar to said nucleic acids and 
proteins, nucleic acids comprising at least 20 consequtive 
nucleotides from said sequences, chimeric genes and vectors 
comprising said nucleic acids, host cell transformed with 
said vector, process for producing said protein using said 
host cell, process for identifying inhibitors of said 
protein's activity, inhibotirs identified by said process, 
and method of suppressing growth of a fungus by using said 
inhibitors. 



4. Claims: 1-20. all partially 

The cytokinesis-like gene from Ashbya gossypii with nucleic 
acid sequences represented in seq.ID 11. encoding the 
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Intomatianai Appfiealion 



Na PCT/EP 99/07501 



respective protein seq.ID 12. nucleic acids or proteins 
substatially similar to said nucleic acids and proteins, 
nucleic acids comprising at least 29 consequtive nucleotides 
from said sequences, chimeric genes and vectors comprising 
said nucleic acids, host cell transformed with said vector, 
process for producing said protein using said host cell, 
process for identifying inhibitors of said protein s 
activity, inhibotirs identified by said process, and method 
of suppressing growth of a fungus by using said inhibitors. 
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